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Chapter 1 

THE IMPORTANCE OF HARMONIC TONES AND METHODS 
FOR THEIR SYNTHESIS AND CONTROL 

1.1 Introduction 

During the past three years, I have investigated the use of certain new 
and flexible forms of analog circuitry for the generation of sound materials 
to be used in electronic music, for electronic musical instruments, and for 
studies in musical sound perception. In this investigation I have stressed the 
synthesis and control of the timbre of harmonic tones (tones which consist of 
harmonic partials). Several special devices which were constructed for this 
investigation are the following: 

.(1) The Harmonic Tone Generator . This electronic device employs additive syn¬ 
thesis to produce at its output a harmonic tone with its fundamental frequency 
and harmonic amplitudes independently variable according to the values of sev¬ 
eral input parameter control voltages. The design of this device is discussed 
in Chapter 2. 

(2) Six Time-Envelope Networks . A voltage step at the input of one of these 
networks produces a time-varying DC voltage output which is used primarily 
for building up attack and decay envelopes for musical sounds. 

(3) Six Time-Delay Gate Networks . A voltage step at the input of one of these 
networks produces another voltage step at the output which is time-delayed with 
respect to the input by a pre-set amount (variable m three ranges from 0 to 5 
seconds). However, the output returns to zero when the input does. Also, by 
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means of a separate part of the circuit an external signal can be turned on and 
off in synchronism with the output. 

(4) Keyboard . A keyboard was wired with a set of variable resistors in conjunc¬ 
tion with a reference voltage which produces a pitch-control voltage corresponding 
to the key depressed. 

(5) AC-DC Converter . A high frequency tone burst (5KC or greater) at the input 
of this circuit produces a DC output which can be used to drive the Time-Enve¬ 
lope Networks or the Time-Delay Gate Networks. Hence, tone bursts recorded on 
magnetic tape according to a rhythmic scheme can be used to actuate these net¬ 
works . 

(6) Analog Multiplier . The output voltage of this circuit is proportional to 
the product of two input voltages which may be of either polarity. Although the 
multiplier was designed primarily for synthesis of special parameter control 
voltages, its wide frequency range (10KC) and its wide dynamic range on one input 
(75 db), permit it to be used as a balanced AM modulator (or "ring" modulator) 

or for envelope control of arbitrary signals. 

Devices (2) through (6) are discussed in Chapter 3. 

(7) Variable Formant Filter . The output of this filter is the sum of the atten¬ 
uated outputs of twelve band pass filters with band-centers logarithmically 
spaced between 189 cps and 2400 cps. The transfer response of this filter is 
capable of assuming a large number of distinct shapes corresponding to the settings 
of twelve attenuator controls. Hence, by subtractive synthesis, a wide variety 

of output spectra can be developed from wide-band flat spectrum inputs such as 
white noise and pulse waves. Curves for the design of variable formant filters 
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according to a "best” criterion for up to 10 filters per octave and three basic 
transfer functions are derived in Chapter 4. 

These devices are now installed at the University of Illinois Experimental 
Music Studio. 1 The overall philosophy of the design of this system of devices 
for generating musical sounds is similar to that applied in the design of an 
analog computer. First, m the case of a computer, electrical signals (to be 
converted into sound signals) are created in this system by means of a set of 
well-defined electrical generators and converters which can in turn be connected 
together in a very large number of possible combinations. Second, DC voltage 
signals which represent simple mathematical functions (a basic characteristic 
of analog computers) are used extensively for the specification and control of 
musical parameters such as amplitude, pitch, and tone color. However, in a 
music system such as this, the absolute phase of an oscillator output is usually 
indeterminate and feedback is seldom employed or required, m contrast to the 
ordinary use of an analog computer. In this music system, the output which is 
to be synthesized is specified a priori as an explicit function of time and of 
the defining parameters. On the other hand, with an analog computer the output, 
which is a solution to a problem, is only specified as an implicit function of 
these variables . Another important difference is that signal bandwidths needed 
for music (20,000 cps for output signals and 1,000 cps for parameter control) 
are generally much higher than those ordinarily required in analog computers 
(about 50 cps). 

The use of DC control voltages permits the continuous control of important 

1. Hiller, L.A. Jr.: "An Integrated Electronic Music Console", J. Audio Eng. 

Soc., 13(2), 142-150, 1965. ~ 
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musical parameters within a single sound event. This is a very important concept 
which now can be exploited in electronic music; until just recently this concept 
has been lacking in practically all electronic tape music. However, within the 
past year and a half several compositions have been completed at the Experimental 
Music Studio which utilize the Harmonic Tone Generator and other devices in 
varying degrees. Among them are "Futility 1964" by Herbert Brun, "Machine Music" 
by Lejaren Hiller; "Underworld" by Salvatore Martirano; and "Antiphonics III", 
"Lemon Drops" and incidental music for "Hydrogen Jukebox" (a play) by Kenneth 
Gaburo. "Lemon Drops" is an electronic tape composition which was created almost 
entirely from sounds produced by the Harmonic Tone Generator. 

In Chapter 5, I indicate how the electronic method of additive harmonic 
synthesis, which utilizes a carrier frequency technique, can be extended for 
the analysis of a sound in terms of the parameters required to define it as a 
harmonic tone. Three mathematically simple versions of harmonic analysis by 
time-variant Fourier series calculations are presented and compared. These re¬ 
sults are applicable to the electronic analysis system, since this system is an 
analog of the time-variant Fourier series method. 

Finally, the sound generation system, which comprises the seven devices 
listed above, can also be used for studies in sound perception. The results of 
one such study, using the Harmonic Tone Generator, are given in Chapter 6. 

This was an attempt to determine the intensity of a single harmonic, in the pre¬ 
sence of the set of lower harmonics, required for the pitch of this harmonic to 
be perceived as distinct from the pitch of the composite harmonic tone. The re¬ 
sults indicate that for the first six harmonics the average level required is 
about 20 phons (subjective intensity units) below the combined loudness of the 



5 


lower harmonics, independent of fundamental frequency from 65 to 520 cps and 
independent of harmonic number. 

1.2 Single Sounds and Harmonic Tones in Music 

Music is basically an organization of sounds within a frequency domain and 
a time domain framework. The ’’message" contained in a piece of music almost al¬ 
ways has no concrete referential meaning. Nevertheless, the composer of the 
music means that a sense of multifarious structure be conveyed to the listener, 
and that he be capable (at least subconsciously) of aurally decomposing the 
sound into simpler constituent parts, making a variety of associations among 
them, and deriving enjoyment from this process. In conventional music, these 
constituent parts are usually recognized as either "single sounds" or'ensemble 
sounds" which are composed of single sounds. At times the distinction between 
these two types of sounds may be quite subtle. Experience in various electronic 
music studios has shown that if the sound of several instruments simultaneously 
playing a chord is altered by removing the attack portion of the chord, the 
aural effect is more like that of a complex single sound. However, when all the 
aural clues are present, especially with respect to differences in time and 
space location, the musically trained person has no trouble separating all the 
single sounds present in a complex musical texture. In fact, with experience 
he can learn to construct a reasonably accurate score from repeated listenings 
to the music. 

By elimination, we are left with the definition of a single sound as being 
a sound which cannot readily be decomposed by a listener into simpler parts. 
Sounds of this type not only include the tones of the common wind, string, and 
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percussion musical instruments-but also many sounds in nature and human speech. 

We are apt, due to our experience, to think of a single sound as that which 
emanates from a single object or spatial location. However, in the case of elec¬ 
tronically generated sound this generalization does not hold at all since it is 
quite easy to reproduce the sound of a full orchestra through a single loudspeaker 
or, on the other hand, to split a single sound into parts and generate them from 
several different spatially separated loudspeakers. Characteristics which prevent 
the listener from aurally diagnosing the frequency spectrum of a single sound 
are: (1) Lack of distinction between the times of entrance of the various fre¬ 
quency components; (2) Small frequency separation between discrete components 
or no separation (continuous frequency spectrum); (3) Harmonic relationship of 
components. 

A "tone" can be defined to be a single sound with a definite subjective 
pitch. This pitch generally results from the predominant concentration of spec¬ 
tral energy at a single position in the frequency domain or from the definite¬ 
ness of a certain time period of repetition. Harmonic tones, which are composed 
of frequencies integrally related to some audible fundamental frequency, are 
of particular importance in mu6ic. Under most conditions the subjective musical 
pitch of a harmonic tone is with good approximation proportional to the log¬ 
arithm of its fundamental frequency (whether or not the fundamental is actually 
present in the sound) . For a given pitch the timbre is determined by the relative 
intensities and apparently to a much lesser degree, the phases of the constituent 
harmonic frequencies. The forced vibration tones of the ordinary acoustical 
musical instruments, executed by bowing strings and blowing into tubes, are har¬ 
monic to a considerable degree. So are the damped vibration tones of certain 
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percussion instruments such as the piano and the guitar. For centuries, harmonic 
tones have been the predominant single sounds used in music of the Western world, 
and because of the definite scalar ordering of the pitches of these tones, and 
the ease of producing all audible frequencies, irrespective of timbre, it is 
not surprising that the theory of pitch organization has reached a high stage 
of development. However, since there exists no obvious ordering of timbre, and, 
until recently, there has been no method of systematically and independently 
varying the parameters which define it, theories of timbre organization are prac¬ 
tically nonexistent. Therefore, the composer of timbre relationships has been 
forced to rely upon pragmatic or a prion judgments, and consequently his 
ordering of timbres has been for the most part restricted to making "differences" 
among them. 

1.3 The Complexity of Musically "Rich" Harmonic Tones 

The question of what constitutes the timbre of a harmonic tone is even 
more complex than determining the effect of different relative intensities and 
phases of the harmonics and the fundamental frequency. There exists an over¬ 
whelming amount of evidence to testify that on the level of sophistication of 
sound production provided by the conventional musical instruments, the "richness" 
of tones depends on the variation of these parameters with respect to time. That 
is to say, the precise manner in which the harmonic intensities build up to their 
final average values at the start of a tone, the variations of these intensities 
during the course of the tone, and other factors such as additive noise, pitch 
deviation, and small amounts of frequency and amplitude modulation also play a 
major role in determining the timbre characteristics of a tone and give it a 
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certain "warmth" which is lacking in less complex electronically generated tones. 

Five musical terms commonly used to describe timbre are: pitch, tone quality, 
attack, vibrato, and tremolo. These refer technically to the fundamental fre¬ 
quency, the average steady-state harmonic spectrum, the start of the tone, and 
regularly fluctuating changes of pitch and amplitude. However, they are over¬ 
simplified descriptions of what may be very complex phenomena. For instance, the 
musician refers to a "soft," "dull," "brillant," or "nasal" tone quality, while 
being quite unaware of the type of the harmonic spectrum which produces this 
effect. Certainly, at a fixed pitch and to a first-order approximation tone 
quality must be determined by the average relative levels of the harmonics. But 
whether or not it is possible to perceive tone quality as a fixed parameter over 
a significant pitch range, and, if so, how the harmonic distribution must vary 
(if it must at all) with respect to pitch in order to keep tone quality constant, 
are questions which cannot be answered with certainty at the present time. 

Nevertheless, it is a well-known fact that the tone qualities of many mu¬ 
sical instruments depend on "formant" characteristics. These are frequency re¬ 
sponse curves usually displaying one or more definite resonances or anti-res¬ 
onances. In this case, the individual levels of the harmonics change with pitch 
and are proportional to the corresponding values of a formant characteristic at 

each of the harmonic frequencies. The formant curves for many of the nonpercussive 

2 

musical instrument tones have been measured in detail by Luce. The tone of a 
bassoon is a good example of a sound which is easily recognized in terms of a 
simple formant characteristic, a resonance at about 500 cps. Recognition of 

2. Luce, D.A.: "Physical Correlates of Nonpercussive Musical Instrument Tones," 
unpublished Doctoral Dissertation, M.I.T., 1963, pp. 225-323. 
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synthetic bassoon tones is enhanced if the pitch is varied, probably .because 
the resonance region is defined more accurately by the resulting movement of 
the harmonics within the boundary of the formant curve. Also, many of the sus¬ 
tained vowel sounds of speech can be identified from the positions of two or 
more formant frequencies, independent of fundamental frequency. 

On the other hand, a type of tone quality invariance may also exist with 
respect to transposition of a sound spectrum. For example, a brass tone retains 
a certain identity when the pitch is changed but the spectrum "shape" is held 
fixed. A trumpet tone, when recorded and played back at twice the original 
speed (a transformation invariant with respect to spectrum shape), sounds like 
a toy trumpet. At half speed it sounds more like a trombone and at one-quarter 
speed, like a tuba. High tones of the cello can in certain contexts be confused 
with middle register violin tones even though the cello formant characteristic 
is roughly a transposition of the violin characteristic by more than an octave 
lower. (Besides tone quality, these intra-family confusions may be due to the 
existence of other parameters which describe a family of musical instruments 
as a whole, such as peculiar amplitude and frequency changes with respect to 
time). The idea of transposition invariance may have some validity for the 
case of sustained vowel sounds. Although for the same spoken vowel the abso¬ 
lute values of the formant frequencies vary considerably for different people, 
especially when children are included m the analysis, there is reason to believe 

that the ratios of the first and second formant resonances of a sustained vowel 

3 

are sufficiently constant to allow their identification. In some sense, this 


3. Pierce, J.R. and David, E.D.: Man’s World of Sound , Doubleday & Company, 

Garden City, N.Y., 1958, p. 98. 
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part of the formant characteristic for one person may be treated as a frequency 
transposition of that of another person. 

The transient behaviors of the attack portions of the tones of several 

4 

wind and string instruments have been measured by Luce. His data show that a 
great deal of information is required to duplicate these attack transients pre¬ 
cisely. Nevertheless, despite the large amounts of randomness in the harmonic 
intensities and phases which are present in many cases, there appears to be a 
considerable amount of statistical order which can be used to distinguish one 
musical instrument from another. The attacks are certainly not describable by 
total envelope modulation of a basic waveform. For several instruments Luce 
determined that the harmonics reach full intensity at slightly different times, 
the time "delay" between the entrances of partials was measured to be on the 
order of 10 milliseconds. 

LeCaine has reported attempts to simulate vibrato and tremolo effects by 

constant sine wave frequency and amplitude modulation and by random modulation 

5 

at rates varying between 5 and 7 cps. Both of these types of simulation tend 
to yield a monotonous sound. I would suspect that "naturalness" of vibrato can 
be achieved by a control of the rate and amount of modulation with respect to 
time as well as the control of certain statistical parameters which describe 
the random components . Also, the introduction of reverberation seems to "blur" 
monotonous repetitions and increase "naturalness". 


4. Luce, D.A. : "Physical Correlates of Nonpercussive Musical Instrument Tones," 

unpublished Doctoral Dissertation, M.I.T., 1963, pp. 100-224. 

5. LeCaine, H. : "Electronic Music", Proc. Inst. Rad. Eng., 44 (4), 457-478, 

1956. 
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1.4 Synthesis of Harmonic Tones 

In electronic music, two of the most useful methods for creating sounds 
are additive synthesis and subtractive synthesis . In additive synthesis, the 
total sound signal is formed by the superposition of separate constituent parts 
according to the proportions desired. In subtractive synthesis, a signal is 
generated which contains the desired frequency components, and a formant filter 
is used to shape the distribution of these components. More precisely, m the 
additive synthesis of a harmonic tone, sine waves tuned exactly in frequency 
ratios 12-3: . . . are summed together, and the desired tone quality is devel¬ 
oped by independent adjustment of the relative amplitude of each harmonic. A 
more general additive harmonic tone synthesizer would have provision for changing 
the phase of each harmonic and the fundamental frequency independent of the other 
parameters. In subtractive synthesis, an easily generated waveform composed of 
a large number of harmonics, with amplitudes which do not diminish too rapidly 
as the harmonic number increases, is used as the input to a spectrum-shaping 
filter. In order to change the spectrum, the filter transfer response is varied. 
Two other methods commonly used to generate sounds, including harmonic tones, 
are modulation synthesis , whereby two simple signals are combined by amplitude 
or frequency modulation to form a more complex signal, and waveform synthesis , 
in which case the wave shape of the output of a generator can be varied within 
certain restrictions. Under the assumption that spectral distribution is the 
most important factor determining tone quality, these two methods do not seem to 
be very flexible for scientific investigations since the distributions which 
arise from given syntheses are not obviously related to the parameters specified. 

Subtractive synthesis is the most common method of tone generation employed 


m electronic organs and other electronic musical instruments for several reasons 
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First, waveforms such as the square wave, the pulse, and the sawtooth wave are 
rich m harmonics and can easily be generated by electronic circuits. Second, 
fixed filters for shaping a number of differentiated spectrums can be econom¬ 
ically produced. Third, compared to additive synthesis, subtractive synthesis 
has no audible upper limit to the number of frequencies which can be generated; 
therefore, complex or "nasal" tone qualities can be produced as easily as "pure" 
tone qualities with low order harmonic content. Fourth, formant filtering of 
waveforms represents a reasonable analog to the acoustic behavior of ordinary 
musical instruments, including the organ, which is the instrument most builders 
of electronic musical instruments attempt to simulate. 

On the other hand, for scientific investigations additive synthesis allows 
much more control over the harmonic spectrum, particularly with regard to changes 
of this spectrum with respect to time. Also, the variety of steady state dis¬ 
tributions which can be produced by subtractive synthesis is limited by the fre¬ 
quency responses of available filters, and the structure of the transient parts 
of the tone produced is restricted to that which results from total modulation 
of the steady state distribution (or filtered parts of it). These limitations do 
not exist for a system of additive synthesis because the harmonics are created 
separately, controlled separately, and then superimposed. 

1,5 Voltage Control of Determinative Parameters 

Only the values of the relative amplitudes and phases of the harmonics and 
the fundamental frequency convey any information about a harmonic tone. In music 
these values must be continually changing if there is to be any information flow 
with respect to time. Since control by manual operation always involves restric¬ 
tions on speed and accuracy, a more fruitful approach to instantaneous timbre 
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control would be some means for automatic programming of the salient parameters. 
An especially compact medium for the instantaneous representation of a parameter 
is an analog voltage, which can be conveniently transported from the output of 
a "parameter synthesizer" to the input of a harmonic tone synthesizer. Thus a 
voltage-controlled additive harmonic tone synthesizer (which I will refer to 
from now on as a harmonic tone generator ) is a device with 2n + 1 analog volt¬ 
age inputs—corresponding to the fundamental frequency and the amplitudes and 
phases of n harmonics—and a single output which is the sum of the n controlled 
harmonic frequencies. By contrast, a voltage-controlled subtractive synthesizer 
would have one input for control of the fundamental frequency and several others 
to control the shape of a formant filter; the shape can be effectively controlled 
by altering the intensities of the outputs of several fixed band pass filters, or 
more flexibly by controlling the bandwidths, center frequencies, and intensities 
of several voltage-variable filters. Hence, in a voltage-controlled parameter 

i 

system, the information-bearing parameters, which are physically separated from 
the main synthesizer system, can be generated by whatever means is available and 
according to whatever designs are desired. The only limitations on the specifi¬ 
cation of a parameter as a function of time are those imposed by the control 
response characteristics of the synthesizer and the capability of the parameter 
voltage generators to produce outputs corresponding to arbitrary functions of 
time. 

Circuitry for generating control parameters is discussed in Chapter 3. It 
should be noted that certain classes of attack and decay shapes are easy to gen¬ 
erate electronically, and that by using these shapes, the effect of independent 
control of the harmonic envelopes can be conveniently investigated. Also, sound 
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effects such as vibrato, trill, "buzz", and "clangorous sound" can be produced 
readily by slow and fast sine and square wave frequency modulations. 

1.6 Some Applications of Harmonic Tone Synthesis 

The input voltages to a harmonic tone synthesizer can be regulated in a 
number of ways, including direct human operation. For instance, a keyboard can 
be wired with a set of resistors and a voltage reference in such a way that for 
each key depressed a voltage is generated which, when applied to the frequency 
control input, determines a pitch corresponding to the key. Similarly, a con¬ 
ducting wire can be arranged to make contact with a biased wire wound resistor 
so as to produce both discrete and continuous pitch-voltage changes according 
to the position of the hand. It would be very difficult for one person to con¬ 
trol simultaneously all the harmonic intensities without some aspects of the 
control being pre-set. One method is to have the voltage generated by the key¬ 
board or conducting wire actuate a set of envelope synthesizdrs with attack and 
decay times controlled by means of several stop positions . Another method is 
touch control, where the pressure exerted on the keyboard produces a proportional 
voltage for control of the intensities. Other effects, such as vibrato, can be 
varied by manual manipulation of the frequency and amplitude controls of a sine 
generator. In this form, the voltage-controlled harmonic synthesizer becomes a 
very flexible monophonic musical instrument, capable of generating an unusual 
variety of timbres at the disposal of the musician performing it. 

i 

For electronic music, contours of pitch and timbre ought to be more rigidly 
controlled with respect to time. These contours, m the form of time-varying 
analog voltages, can be derived from some sort of "synthesizer" of musical param¬ 
eters, analogous to the paper roll used m a player piano. When a large number of 
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parameters is being generated simultaneously to control a number of single 
sounds , or, for that matter, more complex homophomc effects, it is of utmost 
importance that time synchronization be maintained. This condition is satisfied 
for either an analog parallel tape system such as a clear celluloid with sev¬ 
eral distinct channels which are darkened proportionally to each parameter 
intensity or by means of digital-to-analog converters in conjunction with a 
coded system such as punched paper tape, digital magnetic tape, or direct dig¬ 
ital computer output. With this type of synthesizer system m operation, made 
feasible by the voltage-controlled circuitry, the conventional time-consuming 
method of adjoining musical events by tape splicing would be less frequently 
required, 

As mentioned in Section 1.1 harmonic synthesizers have useful application 

\ 

for studies in sound perception. Data derived from a thorough investigation of 
the harmonic tone may shed light on certain properties of the reaction of the 
ear which have not been heretofore tested, particularly the response of the ear 
to complex transient signals. Also, through experience in using these devices 
m electronic music we cannot help but gam insight into the problem of predicting 
the specification of a sound when given its imagined subjective effect (e.g., m 
the mind of the composer). 

In general, musicians need to acquire an education about the quantifying 
of timbre comparable to their traditional education about pitch organization. 

Since their contact with a subject is most direct when sound and physical rep¬ 
resentations and not mathematics are involved, a harmonic tone synthesizer in 
conjunction with a good sound playback system and, perhaps a visual spectrum 
analyzer, would constitute a good educational tool for an explanation of timbre 
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in quantitative terms. People who work in scientific areas may also be interested 
to know the relation between a mathematical Fourier series and the waveform and 


sound it produces 
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Chapter 2 

DESIGN OF A HARMONIC TONE GENERATOR 


2.1 Introduction 

As discussed in the previous chapter, a harmonic tone generator, which 
produces a harmonic series of frequencies with fundamental frequency, harmonic 
amplitudes, and harmonic phases independently adjustable, is essential for the 
further development of electronic music and science-oriented music research. 
However, before describing a practical realization of this instrument, it is 
appropriate to present in detail some of the basic design decisions and factors 
which restrict the design possibilities. To the designer of a harmonic tone 
generator the following questions must be posed: (1) What is a convenient and 
flexible form of the operation of this device? (2) What are some possible al¬ 
ternatives for its realization? (3) What requirements on tolerances and ranges 
of the defining parameters are necessary because of musical and perceptual 
considerations? 

2.2 Operational Form 

For the production of musical sound the operation of an additive synthesizer 
is m a very convenient form if the parameters which define the output correspond 
to the important musical parameters: pitch (frequency of the fundamental) and 
tone quality (distribution of the amplitudes and, to a much lesser extent, the 
phases of the harmonics). It should also be possible to vary these parameters 
independently of one another. Furthermore, m order to achieve a most flexible 
and instantaneous means for controlling these parameters, it is advantageous 
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that they be remotely and independently programmable by external analog DC . 
voltages rather than by means of manual or servo control of variable physical 
components. Hence, this is a device with several inputs and a single output. 

A representation of its output voltage can therefore be written explicity in 
terms of its input voltages: 

n 

V out = S a k (t)cos(27T k/f(t)dt + 0 k (t)) (2-1) 

k=l J 

a k *t) = c k V a (t), <£ k (t) = pV, (t>, f(t> = q(V f (t>) 
a k Tc 

a is the instantaneous amplitude of the kth harmonic, 

K 

V is the kth harmonic amplitude control voltage, and 
& 

c is the constant relating the two (set by an attenuator) . 

K 

<p is the instantaneous relative phase shift of the kth harmonic, 

K 

Vi is the kth harmonic phase control voltage, 

p is a constant which relates <f> and Vj. . 

f is the instantaneous fundamental frequency, 

V f is the fundamental frequency <i.e., pitch) control voltage, 
q is a function which relates f and V f . 
n is the number of harmonics which are generated. 

Since any voltage source, such as a sine generator, can be used as a fundamental 
frequency, harmonic amplitude, or phase input, the parameters which define the 
output normally vary with time „ 

2.3 Three Possible Method^ for Additive Generation of Harmonic Tones 

The first method given below was adopted for the construction of a prac¬ 
tical harmonic tone generator, which is described in Section 2.5. However, 
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two other methods were considered. These will be described more briefly. 

It should be noted that none of these methods utilize physical moving 

parts. They are completely electronic. Harmonic generators have been built 

0 

using AC generators driven by harmonically geared motors. Also, a few musical 

instruments have employed electro-mechanical additive synthesis techniques 

For example, tones produced by the Hammond Organ are formed by the superposition 

7 

of six individually controlled partials. These partials are not tuned to 
strict harmonic ratios, however, since the frequencies are borrowed from the 
equal-tempered scale. A particularly elaborate harmonic tone synthesizer was 

g 

constructed by Bell Telephone Laboratories in 1940. The waveforms of 100 har¬ 
monic sine waves were recorded on separate channels of a rotating magnetic 
drum. These sine waves were then read from the drum and filtered to increase 
their purity. Their individual amplitudes were regulated by means of a set of 
100 linear attenuators mounted on a single panel. The fundamental frequency of 
the harmonic tone generated could be altered by changing the drum's speed of 
rotation. 

a ■ Hetrodvne Method 

The generation of harmonics by the hetrodyne method involves the use of 

two ultrasonic carrier frequencies: f . which is fixed, and f , which is 

c u 

variable, f is variable between the values f and f + f , where f is 
u c c max max 

6 . Miessner, B.F.: "Electronic Music and Instruments," Proc, Inst. Rad. Eng ., 
24(11), 1427-1463, 1936. 

7. Dorf, R.H.: Electronic Musical Instruments , Radio Magazines, Inc., Mineola, 
N.Y., 1958, pp. 25-45. 

8 . Fletcher, H.: "The Pitch, Loudness and Quality of Musical Tones," Bell 
Telephone System Monograph No. 1397, 1946. 
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an audio frequency. The audio beat tone which results from mixing f c and f is 
f = f - f . f is the fundamental frequency of this generation scheme, and 

£t U C 3. ^ 

it varies between 0 and f . If the kth harmonics of f and f , namely kf 

max u c’ u 

and kf . are generated and mixed, the frequency of the resulting beat tone 
c 

is kf - kf = k(f - f ) = kf . which varies between 0 and kf 

u c u c a max 

This gives a method for deriving a series of audio harmonics from the 

corresponding harmonic series of two ultrasonic frequencies. The attendant 

advantage is that the ultrasonic harmonics can be separated by means of filters 

over a reasonably wide frequency range. There are several methods of performing 

this separation, but the logically least complicated way is to t iparate the 

harmonics generated by a special waveform such as a pulse wave. Since f^ is a 

fixed frequency, its harmonics can be easily separated by narrow band filters. 

However, there are restrictions on the response characteristics of the band 

pass filters used to separate the harmonics of which depend on the values 

of f , f , and n, the number of harmonics to be generated. Even if perfect 

filters with rectangular response characteristics were available, the relation, 

n(f + f ) < (n + l)f (2-2) 

c max c , 

must be satisfied to allow the separation of the nth harmonic from the (n+l)st 
harmonic of f over its entire frequency range. Given that n harmonics are to 
be separated over the entire frequency range of the audio fundamental, Equation 
(2-2) can be solved to yield the following restriction on the value of the 
"carrier frequency" f : 

, Bf <2y3) 

f ^ nf 
c > max. 

In actual practice, the use of filters of reasonably low complexity dictates 

that f be at least twice nf 

c max 
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If kf is the drive frequency input to a ’’chopper-type" phase detector 

and the detected signal is a sine wave of frequency kf^, there results a pure 

audio frequency output, (V Q / 7 r)cos (k 27 ff + <p ^), where V a and <p^ are the 

k k 

amplitude and phase of the fixed frequency kf^, respectively. The phase of 
kf c can be controlled by means of a phase modulator circuit, and the amplitude 
can be controlled by means of a chopper circuit which samples the amplitude 
control voltage for one-half cycle at the rate of kf^ m order to produce a 
variable-amplitude square wave. This is converted to a sine wave by a tuned 
circuit immediately preceding the phase detector. The entire process for 
generating one audio harmonic with voltage-controlled amplitude and phase is 
depicted in Figure 2.1. 

Pure audio harmonics can also be produced if the phase detector is 
driven at the rate f ( instead of kf >, and care is taken to insure that the 
detected signal is free from other frequencies harmonically related to f^. 

When this method is used, the need for band pass filters to separate the har¬ 
monics of is eliminated. The output of the phase detecton for each harmonic 


is maximized if the samples of the-detected signal, which occur every 1/f 


u 


seconds, are of optimum duuation a,/f . Its precise form can be determined 
5 k u 

through analog multiplication of the signal V a ^cos(k2fl-f^t + (p^) and the Fourier 
series representation of the appropriate pulse tram: 


V pd = V cos(k 27 rf c t + <^) 


pd 

out 


+ ~ S — - ( j—— cos[j 277 (f + f )t] 
IT J= i J c a 


(2-4) 


2? a u 


sm(kffa k ) 


cos(k 27 /f a - (p^) + high frequency terms. 





7T 


V Q (t)c 0 S(K 27 Tf 0 t-* K 


♦ HIGH FREQUENCY TERMS 


* 



W’ V*> 


(t)) 


Figure 2.1 Block Diagram for Generator of Audio Harmonic 
with Voltage-Controlled Amplitude and Phase. 
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The amplitude of the audio component of the output is maximized when the ratio 
of the sample duration to the interval of repetition is given by a = m/2k, 

where m is the largest odd integer not exceeding k. 

If it is not required that the audio harmonics be generated independently, 
i. e., if an output consisting of the sum of the modulated harmonics is suffi¬ 
cient, there results a considerable savings m circuitry. In this case, the 
harmonics of f are formed, phase and amplitude modulated, and summed prior 
to detection by a single phase detector which is driven at the frequency f 
The audio output is; 

Hi 2 Va k co S <k2fff a t - 4> (2-5) 


where a/f u is the phase detector sample time chosen to produce suitable ampli¬ 
tudes for all of the n harmonics. The harmonic amplitudes corresponding to a 
standard control voltage level should be equalized by a pre-emphasis of the 
weaker harmonics of f before summation. The V n . 's must be so scaled that the 
composite signal does not exceed the peak value upon which the phase detector 
circuit can linearly operate. Since a is not optimum for each harmonic, ,and 

each V„ is reduced, the resulting amplitudes of the audio harmonics are con- 
a k ' 

siderably less for a system with a single phase detector than for a system 
utilizing individual phase detectors for each harmonic (with equivalent cir¬ 
cuitry) . 

In any case, the variable ultrasonic frequency f is derived from the 
output of a voltage-controlled oscillator, so that the fundamental audio 


frequency is a function of a pitch control voltage. 
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The chief advantages of the hetrodyne method are: (1) The harmonics are 
automatically synchronized over the entire range of the fundamental; and (2) 
each of the important parameters, the harmonic amplitudes and phases and the 
fundamental frequency, is voltage-controlled. A disadvantage of this method is 
that the circuitry is quite complex, requiring separate circuitry for each 
harmonic. Also a practical disadvantage is the difficulty m realizing an 
ultrasonic v.c.o. with sufficient frequency stability. However, this can be 
improved if f is derived, by single-side-band operation, from the output of a 
more stable lower frequency v.c.o. In this case, care must be taken to prevent 
harmonics of the lower frequency from coupling into the ultrasonic system and 
by hetrodyne action producing inharmonic image frequencies in the audio output. 

b. Multiplier-Subtracter Method 

A second method for generating harmonics which was also given some consid¬ 
eration involves a multiplier-subtracter process. This method for producing 
independent harmonics does not employ frequency sensitive elements and has no 
theoretical frequency range limitations. It utilizes the trigonometric identity, 

cos(jwt)cos(ktot) = £cos(j-k)ut + £cos(j+k)wt, (2-6) 

which can be abbreviated by 

jxk-3»(j-k) + (j+k). 

The extraction of five harmonics from a fundamental sine tone is then accom¬ 
plished by the following procedure: 

lxl-> 0 + 2 - * 2, 

1x2 - 1-» 1+3-1 ->3, 
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2x2 —> 0 + 4 -4, 

2x3 -1—1 + 5-1 —5 } or 1x4 - 3 3 + 5 - 3 5. (2-8) 

It is clear, therefore, that any number of harmonics can be separated by 
means of a system consisting solely of multiplier and subtracter circuits. An 
audio voltage-controlled oscillator is the source of a fundamental sine tone 
and other multipliers are used to control the individual harmonic amplitudes. 

! 

The even harmonics can be derived by squaring circuits so that for multipliers j 

I 

I 

operating by means of the "quarter-square" principle (each multiplier containing 

two squarers), the equivalent of one multiplier can be used to derive two even j 

t 

harmonics. % 

There are, unfortunately, several obvious drawbacks to this type of har¬ 
monic generator: (1) Multipliers of sufficient accuracy and wide frequency 
range are very expensive. (2) Cancellation of harmonics by subtracting signals j 

is subject to phase and amplitude errors. (3) Errors in producing higher har- I 

monies are bound to accumulate and exceed tolerable limits. (4) The relative 

i 

phase shifts of the harmonics cannot be easily controlled. 

c. Voltage-Controlled Oscillator Method 

Finally, as a third possible process a set of identical sine wave v.c.o.'s 

with either linear or exponential frequency-versus-voltage characteristics 

„ . 1 

might be arranged to track a master control voltage so that the individual 

output frequencies would be at fixed ratios to one another. In particular, 

the ratios can be harmonic. Suppose that the response curves are linear, so 

that for each oscillator f = cV^. Suppose further that each of the v.c.o.'s 

is preceded by an attenuator. Then, if a master voltage V is applied in parallel 

to the inputs of these attenuators and they are set so that the input voltages I 


1 
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to n v.c.o.*s are V /n, 2V /n, 3V /n. .... (n-l)V /n, V . consecutively. 

m * m * m ’ * m * m’ J ’ 

\ 

the output frequencies are in the relation 1:2:3: . . . : n-l:n, respectively. 

Similarly, suppose that the response curves are exponential, so that f ^ = 

k^ e k 2 v m £ or eac h oscillator and that the v.c.o. *s are provided with voltage 

summers at their inputs. Then, if V is applied in parallel to the summer 

m 

inputs and summed consecutively with the voltages 0, ln(2)A , ln(3)A . . . , 

2 2 

ln(n)/k the v.c.o output frequencies are again in harmonic relation. 

Ct 

Three advantages of either of these systems are: (1) All sub-units are 
identical; (2) nonlinear networks for extracting harmonics are not required, 
and (3) the oscillators may be used for applications other than harmonic 
synthesis (for generating inharmonic tones or separate tones). On the other 
hand, the requirements with regard to frequency stability and frequency track¬ 
ing over a wide range are very stringent. A 40 to 20,000 cps frequency range 
is useful, but very small frequency errors (2.5 cps or .2 peicent, whichever is 
greater) are perceptible Moreover, the oscillators are not synchronized 
(although synchronization apparatus could be introduced for applications which 
require a limited frequency range), and so the relative phases of the harmonics 

are continually changing with respect to one another According to more than 

. 10 , 11,12 

one researcher, phase changes in the higher numbered harmonics do 

9 Shower, E.G and Biddulph, R : "Differential Pitch Sensitivity of the Ear.” 
J. Acoust Soc. Am , 3(2) , 275, 1931. 

10. Mathes, R.C. and Miller, R.L. : "Phase Effects in Monaural Perception," 

J. Acoust. Soc Am , 19(5) , 780-797, 1947. 

11. Licklider, J C.: "Effects of Changes in the Phase Pattern Upon the Sound 
of a 16-Harmonic Tone," J, Acoust. Soc. Am. . 29(6) . 780 (abstract), 1957 

12 Schroeder, M R.: "New Results Concerning Monaural Phase Sensitivity," 

J Acoust Soc. Am , 31 (11) , 1579(abstract), 1959. 
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affect the perception of tone quality, and it follows that the timbre of a tone 
with non-synchronized harmonic partials is not necessarily a fixed quantity 
with respect to time. 

If the v.c.o.'s are "beat-frequency" oscillators, there is a particularly 
convenient method for amplitude modulating the individual harmonics (as described 
in Section 2.3a), otherwise, the amplitudes of the individual harmonics must be 
controlled at the outputs of the v.c.o.'s which generate them. 

2.4 Some Design Considerations 

A precise definition of the transfer characteristics of a harmonic tone 
generator, according to Equation (2-1), requires that the specifications be 
given for the values of n, p, and q, for the range of values for f(t), V f (t), 
the V (t), c , and 0 (t), and for the tolerances permitted these values as 

cl, K K 

k 

well as other criteria relating to the "closeness" of Equation (2-1) as a 
description of the measured output. 

a. Number of Harmonics 

The question arises as to how many harmonics are needed to create aurally 
significant tones and how many harmonics should be included m an additive 
synthesis system before a point of "audible diminishing returns" is reached. 

There are some data available on conventional musical instrument tones which 
may help answer this question. The second column from the left in Table I. 

13 

gives the lowest fundamental frequency normally played on each instrument, 

the next column gives the frequency at which the formant response of the mstru- 

2 

ment is 30 db below the peak resonant value, the last column on the right gives 
13. Culver, C.A. . Musical Acoustics, McGraw-Hill, New York, 1956, p. 95. 
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Instrument 

f x (cps) 

f 2 (cps) 

V 

Flute 

261 

3000 

11.5 

Oboe 

233 

8000 

34,3 

English Horn 

165 

4000 

24.3 

Bassoon 

58 

2000 

34.5 

Trumpet 

165 

3000 

18.2 

Trombone 

82 

2000 

24.4 

Tuba 

41 

1200 

29.3 

French Horn 

62 

800 

12.9 

Violin 

196 

7000 

35.7 

Viola 

131 

3000 

22.9 

Cello 

65 

800 

12.3 

String Bass 

41 

1500 

36.6 


Table I 


Estimate of Number of Harmonics Required to Simulate the Tones of Several 
Musical Instruments Over Their Complete Ranges (n). f^ = lowest fundamental 
frequency, = highest frequency at which formant response is less than 30 db 

below peak formant response. 
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the quotient of these two values, i.e., an estimated measure of the maximum 

number of harmonics required to simulate each instrument's steady-state tone. 

Note that according to this estimate, the flute, cello, and French horn can 

be simulted over their complete ranges with twelve or thirteen harmonics 

14 

whereas many of the other instruments require over thirty harmonics. These 
figures may be unduly pessimistic due to masking factors, and subjective 
experience with additive synthesis systems may be the only means of settling 
this question. I feel that twelve harmonics are probably sufficient for 
initial investigations into the perceptive properties of harmonic tones. 
Although the six harmonics which are now producible on the Harmonic Tone 
Generator model described m Section 2.5 are inadequate for creating a 
complete range of steady-state timbres, this generator is still interesting 
to many composers of electronic music because of its unusual flexibility for 
creating changes in the frequency and amplitude parameters which conventional 
instruments cannot simulate. 
b. Control of the Fundamental Frequency 

Four factors to consider when specifying the control of fundamental 
frequency are: frequency range, frequency stability, the frequency-vs.-voltage 
control curve, and the time lag between frequency changes and discrete control 
voltage changes 

For any practical voltage-controlled oscillator which is to be the source 
of a variable frequency, instability generally increases with increasing 


14. V.0. Knudsen proposed m 1936 that a generator of 40 harmonics be produced 
See his "Modern Acoustics and Culture," University of California Press, 
Berkeley, California, 1937, pp. 22-31. 
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frequency range; hence, the frequency range must be limited. It is evident 

that taste and perceptual limits have influenced the choice of the upper pitch 

limits for ordinary musical instruments. Of the conventional instruments, only 

the piano and pipe organ generate fundamental frequencies above 4000 cps. , and 

15 

only the violin, harp, flute, piccolo, and xylophone exceed 2000 cps. Also, 

listening tests indicate that the perception of musical pitch falls off rapidly 
16 

above 4000 cps„ If the maximum fundamental (first harmonic) frequency output 
producible by a harmonic synthesizer is chosen to be f ^ = 2000 cps,, a 4000 
cps, range is easily accommodated through setting the odd harmonics to zero 
amplitude. 

As mentioned above, pitch discrimination tests made under»optimum listening 

conditions indicate that a suitable criterion for frequency stability is: 2.5 

g 

cps, or ,2 per cent whichever is greater. This should be maintained over a 

range of room temperatures likely to be encountered in a temperate climate, 

i.e„, 60 to 90°F. The maximum short-term frequency deviation which is tolerable 

should also be specified, and in this case tape recorder flutter data are quite 

applicable. An acceptable criterion for frequency “jitter'’ seems to be: ,3 per 

17 

cent r,m,s, integrated over components between 1 and 200 cps. 


15„ Olson, H.F, : Acoustical Engineerin g, D. Van Nostrand Co., New York, 1957, 
P. 610. 

16, Bachem A. : "Chroma Fixation at the Ends of the Musical Scale," J, Acou st„ 
Soc, Am, , 20(5), 704-705, 1958, 

17. Haynes, N.M. : Elements of Magnetic Tape Recording , Prentice-Hall, 
Englewood Cliffs, N.J., 1957, pp. 183-193, 
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18 

pitches has a great deal of validity for musicians, despite the fact that 

the number of just-detectible frequency changes per octave increase signifi- 

19 

cantly from the low to high frequency ranges. Our Western equal-tempered 
scale is, of course, strictly logarithmic. Therefore, an anti-log or expo¬ 
nential curve describing the frequency as a function of control voltage would 
be a logical type to specify. 

Eirom the standpoint of music system design, both the exponential and 
linear functions are convenient because of the attendant possibilities for 
frequency transposition. In the case of exponential control, any sequence of 
voltages, converting to a pattern of pitches, can be arithmetically added to 
a fixed-voltage in order to produce a composite pitch control voltage. This 
in turn, produces an interval transfbrmation of the original pitch pattern 
proportional to the "augmenting" voltage. Similarly, if this "augmenting" volt¬ 
age, which is added to the "main" pitch control signal, alternates in such a 
way as to produce a bibrato or trill with a certain frequency deviation with 
respect to a given "main" frequency, e.g., a semi-tone, this frequency interval 
deviation will remain fixed for any "main" frequency. The electronic design 
is simplified if the pitch control voltage has a single polarity. Therefore, 
a satisfactory frequency control function would be exponential from 2093 cps. 
(C^) to 32.7 cps.CC^) (about the lowest useful fundamental) and trace some 

18. Ward, W.D. : "Subjective Musical Pitch," J. Acoust> Soc. Am ., 26(3), 
370-380, 1954. 

19. Lewis , D. : University of Iowa Studies in the Psychology of Music , Vol. 

IV, 1937, University of Iowa Publication, Iowa City, p. 346. 
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other curve to the point zero volts, zero frequency. In this way, the gener¬ 
ator output can be "turned off" by setting the pitch control voltage to zero, 
independent of the amplitude control. 

In the case of linear frequency control, transposition of pitch sequences 
can be implemented by means of attenuation or amplification of the control volt¬ 
age, rather than by addition of a fixed voltage. However, the superposition of 
a constant interval trill or vibrato cannot be implemented by a fixed atten¬ 
uator. The principal advantage of linear control lies m its relative insen¬ 
sitivity to control voltage inaccuracy 

When a sequence of discrete pitch control voltages is applied to the 
harmonic tone generator frequency input, one should not be able to distinguish 
the presence of a frequency glide between the resulting pitches. The minimum 
time which gives this impression when a pitch changes from one discrete value 
to another has been informally measured m our studio to be between 10 and 20 
milliseconds. 

c. Harmonic Amplitude Control 

For simplicity in system planning and in the electronics design, a 
linear relation between the harmonic amplitudes and their control voltages seems 
most logical. Although a case can be made for exponential control, since the 
perception of intensity is close to being a logarithmic response, the use¬ 
fulness of linear control m producing easily predictable sideband modulations 
from alternating external voltages should be noted. The maximum modulation rate 
should be sufficient to allow the production of sharp attacks. The shortest 
rise time which can be discriminated from the immediate turn-on of a tone is 
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20 

about 1.5 milliseconds. Therefore, a 1 kc modulation bandwidth is sufficient. 

The linearity of output intensity of a single harmonic with respect to its 
control voltage is not critical since no one envelope function takes a priori 
precedence over another. Nevertheless, for good correlation between the external 
envelope control and the aural perception of an attack envelope resulting from 
this control, the linearity should be within 1 per cent full scale. A conveni¬ 
ent full scale voltage is 10 volts. 

Two other factors which degrade amplitude control are (a) any finite amount 

of a harmonic output when its control voltage is zero (zero leak); (b) any 

amount of the control signal which appears at the output (control leak). The 

zero leak should be sufficiently small compared to the maximum harmonic output 

as to be inaudible. This requirement for inaudibility depends on the harmonic 

frequency, normal maximum listening level, and environmental background noise, 

21 

The Fletcher-Munson curves demonstrate that for a 3000 cps, sine tone heard at 
a maximum 90 phons in a noise free chamber, the zero leak must be about 90 db down 
from this level to be inaudible to acute ears, whereas at 50 cps., only a 35 db 
difference is needed. These curves were derived under ideal conditions, however, 
and in a room with reasonable background noise, such as the University of 
Illinois Experimental Music Studio, a 70 db reduction seems to be adequate at 

20. Mathews, M.V, : "The Digital Computer as a Musical Instrument," Scienc e, 
142(3592) , 553-557, 1963, 

21, Fletcher, H. and Munson, W.A. : 'Loudness, Its Definition, Measurement 
and Calculation," J. Acoust, So c , A m, , 5(2), 82-108, 1933, 
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any frequency. The amount of amplitude control signal which appears at the out¬ 
put for any modulation frequency should be balanced out to at least 40 db below 
the output level. One reason for this balance requirement is that during the 
transient period a sharp attack envelope voltage appearing at the output con¬ 
stitutes a very wide spectrum and will be perceived as an undesirable "click". 
The click should be low enough in intensity to be masked out by the sound 
itself. 

d. Harmonic Relative Phase Control 

Statements about the dependence of tone quality on static relative phases 
are still very speculative, although it now seems certain that there is some 
effect .' l0,11,12 For at least two reasons, it is believed that phase distrib¬ 
utions have a secondary effect compared to frequency distributions!. First, the 
sounds of electronic sources and of the steady-state tones of musical instru¬ 
ments are not appreciably altered by playing these sounds through high quality 
reproducing systems (including loudspeakers) which have widely different 
phase shift characteristics. Second, standing-wave distributions in small 
rooms introduce large phase shift variations throughout with the result that 
the perceived phase distribution of a steady signal should depend on the 
listener's position in the room and the position of his head. However, it is 
not common experience for steady state timbres to change significantly as one 
walks about a small room. In order that this problem may be investigated in 

greater detail each harmonic should have provision for changing its phase a 
o 

full 360 as a function of its phase control voltage. On the other hand, it 
is felt that the installation of the fundamental frequency and harmonic ampli¬ 
tude controls is much more important for music applications and should be 
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completed first. 

The effects of sudden phase changes during the attack transient and slow 
random variations during the steady state are even more uncertain. It is 
believed that periodic phase variations exceeding 100 cps are not required 
for this investigation because these variations are perceived in terms of 
side-band spectral components and destroy the effect of a harmonic tone. 

e . Purity of the Output Signal 

Besides the requirement that the various parameters contained in Equation 
(2-1) be represented within a suitable tolerance, there is the requirement 
that the level of added unwanted frequency components be below a certain toler¬ 
able level. These components can be categorized into three types: (1) harmonic 
distortion of the individual harmonics; (2) discrete inharmonic components (the 
relative intensity and frequency of these may well depend on the fundamental 
frequency and the particular harmonic analyzed); (3) additive white noise. A 
harmonic distortion level of up to 2 per cent (mostly second and third har¬ 
monic distortion) is usually not noticeable since ssubjective harmonics are 

22 

usually generated within the ear anyway. But inharmonic tones, even when 
reduced to an intensity of 40 db or more below the harmonic output can be 
quite disturbingly evident, particularly when these components are far removed 
in frequency from the harmonics and hence are not sufficiently masked by them. 
(A possible source of inharmonic components is noted in Section 2.3a). A suit¬ 
able requirement is that the inharmonic frequencies be 70 db below the harmonic 


22 .Hunter, J.L. : Acoustics , Prentice-Hall, Englewood Cliffs, N.J., 1957, 
pp. 253-256. 
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output at all levels and that the additive noise be 70 db below full harmonic 
output. 

2.5 Description of Circuitry for A Practical Additive Sythesis Device, the 
Harmonic Tone Generator 

A solid-state version of the Harmonic Tone Generator, which generates 
six independently controlled harmonics, was designed and constructed at the 
University of Illinois Experimental Music Studio according to the "hetrodyne 
method" described above. The fundamental frequency, the harmonic amplitudes, 
and the phase shift of the second harmonic are voltage-controlled parameters. 

A complete but simplified block diagram illustrating the electronic organi¬ 
zation of this device is given in Figure 2.2. 

Figure 2.3 shows the schematic of the "frequency synthesizer" sub-unit 
which generates a pair of ultrasonic frequencies: on one channel a fixed 50 kc 
"carrier" tone and on the other channel a tone whose frequency varies from 50 
to 52 kc as a function of the pitch control voltage. This variable frequency 
is derived from the output of a voltage-controlled oscillator, the frequency 
of which varies from 7 to 5 kc as the pitch control voltage changes from 0 to 
-11 volts. The v.c.o. output is mixed nonlinearly with a fixed 57 kc reference 
tone, and the resulting 50 to 52 kc difference frequency is filtered out. In 
this way, a relatively high, voltage-controlled frequency is produced with 
the stability inherent in a lower frequency oscillator. 

The circuit diagram for this voltage-controlled oscillator is included in 
Figure 2.3. The three varicap diodes shown in the circuit act together as a 
pure capacitance, the value of which varies with the applied voltage, in this 
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case 11,5 - volts, 


according to a regularly defined function 


23 


C = 3K(11.5 - V„ -f V )" n , 
to 7 


(2-9) 


where K, V , and n are parameters peculiar to a single varicap diode. 

In addition to this capacitance, the frequency of the oscillator is a function 

of the inductance and the capacitance of the coil and the transistor input 

circuit. C,„ It is given by the well known formula, f 
d 

Therefore, the fundamental frequency of the Harmonic lone Generator output is 
given by 


vco 


27T 


njh(C+c d )' 


f ^ 7000 - f = 7000 - 
a vco 


1 

2 ?rJUcTcp 


( 2 - 10 ) 


- 7000 - - ==r=a=rai=Brr:'r--=T — 

27rVL(3Kai 5-v. v ) -n J. C J ) 
f o d 

An empirically derived plot of f versus the pitch control voltage V is shown 

cl I 

m Figure 2.4. It happens that for control voltages between 0 and -4 volts 
the curve is quite linear, i.e , 1 100 |v^|, whereas between -5 and -11 volts, 

the frequency is, to a good approximation, an exponential function of voltage, 
i.e., f i 180.10'This curve could be corrected by means of a diode 
function generator circuit, to achieve strictly linear or exponential control. 

The odd harmonics of the fixed and variable ultrasonic frequencies gen¬ 
erated by the "frequency synthesizer” are extracted from square wave distortions 
of these tones. A symmetrical square wave is obtained by means of a limiter 
circuit, shown m Figure 2.5, which utilizes matched silicon diodes. Hence, 
filtering of the third and fifth harmonics is simplified since the output 
waveform is quite free of even harmonics. Nevertheless, because the relative 


23. Weldon, L.A. and Rapski, R„L. : Boost for Electronic Tunning: Part 1," 

Electronics , Vol. 37, April 6, 1964, pp„ 49-53. 
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Figure 2.4. Fundamental Output Frequency vs. Input 

Frequency Control Voltage, Harmonic Tone 
Generator 
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levels of the first, third, fifth, and seventh harmonics, are 0, -10, -14 
and -17, respectively, considerable filtering is required to reject unwanted 
components to a level 60 db below the desired components. By comparison, it 
is easy to extract the even harmonics. A full-wave rectifier circuit, shown 
in Figure 2.6, is used as a frequency doubler, m conjunction with a band pass 
filter. The second harmonic, which is the component desired, and the fourth 
harmonic, which is to be rejected, have relative levels of 0 and -14 db, 
respectively, and without adjustment, the undesired odd harmonics are at a 
level of -30 db or lower. 

The harmonics of the fixed "carrier" tone, once separated, are (in 
general) next subjected to phase and amplitude modulation before they are 
mixed with the corresponding harmonics of the variable tone to form audio 
harmonics. 

In the construction of the Harmonic Tone Generator, first priority was 
given to the installation of the harmonic amplitude and fundamental frequency 
control functions Because the harmonic phases have a much smaller influence 
on timbre, the installation of phase controls was given lower priority. 
Consequently, at the present time phase control circuitry has only been completed 
for the second harmonic. However, phase controls for the other harmonics may be 
added at a later time. 

Figure 2.7 gives the schematic for a phase modulator circuit which is 
used for the second harmonic. The complex voltage gain of this circuit is 

V QUt - R - j (uL - 1/uC) _ e J 0 (2-11) 

V R + j (cdL - 1 /ojC) 

in J 

The amplitude of the gam is unity, independent of any of the variables, 

whereas the phase is given by , -lfeJL- 1/uC 

d> = -2tan --- 

L B 


( 2 - 12 ) 








Figure 2.7 



Modulator Schematic 
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Since C is the capacitance of a set of varicap diodes and is a function of the 
applied voltage, the phase, in turn, is a function of this single-polarity 
phase-control voltage. The phase shift varies over 180° as the control voltage 
changes from 0 to -16 volts. 

Linear amplitude modulation of each harmonic of the fixed 50 kc "carrier" 

tone by a control voltage of either polarity is achieved with a "chopper" circuit, 

which is also useful as a phase detector and as a balanced modulator (used in 

the frequency synthesizer sub-unit). The diagram for this circuit, which uses a 

wide-band transformer and a 3N68 integrated chopper, is shown in Figure 2.8 

along with its equivalent circuit, a single switch which opens and closes at 

the rate of the input "carrier" frequency. The output of the chopper alternates 

between the input voltage V & (t) and zero volts at this rate. It follows that 

the signal produced is identical to that which would result if V (t) were 

a k 

multiplied in analog fashion by a square wave alternating between zero and one. 

By using the Fourier series representation of this square wave, the output of 
the chopper can be expressed in terms of its frequency components: 



V (t) 5 + - 2 (-D 1 c°s[ (2i+l)(k27rfrt +&)] 

a k 2 71 i=0 2i+l 


(2-13) 


A tuned circuit which selects the component, (2/7 t)V (t)cos(k27rf t + 0 ), 

a k c k 

prior to phase detection, completes the amplitude modulator. The tunedcircuit Q 
should be large enough for sufficient rejection of other harmonic components 
but not so large that a 2 kc amplitude modulation bandwidth cannot be accommodated. 
When this amplitude-modulated sine wave is applied to the same chopper circuit 



AMPLITUDE CONTROL VOLTAGE 

(LOW IMPEDANCE DRIVE) 



Figure 2.8 Chopper Circuit. 
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2 

= (2/7T )V & (t)cos(k27Tf t - <^) high frequency terms, (2-14) 

k 

The first term of the last expression represents, then, the only audible com¬ 
ponent—the kth harmonic with voltage-controlled amplitude and, m the case 
of the second harmonic, the phase. 

The entire process-frequency multiplication, phase modulation, amplitude 
modulation, and phase detection—is summarized as a block diagram m Figure 2,1, 
The relative amount of each phase detector output is set by a manual 
attenuator before summation of all six harmonic signals is accomplished. A 
sharp cutoff low pass filter is used to remove the ultrasonic frequencies from 
the signal. Therefore, the complex audio waveform corresponding to the attenua¬ 
tor and control voltage settings can be observed on an oscilloscope screen. 

The pitch and harmonic amplitude control voltages are actually composite' 
voltages, that is to say, each control voltage is the sum of several individual 
control voltages. For each of the parameters, one of these voltages is adjusted 
by a manual potentiometer mounted on the front panel. Therefore, the Harmonic 
Tone Generator can be used as a manually operated generator, as well as a con¬ 
verter of signals from one form, into another Also, for both the fundamental 
frequency and the amplitude controls, it is useful to have the capability for 
superimposing on 'main 1 parameter controls signal perturbations which effec¬ 
tively produce vibrato, trill, tremolo, and the like 

The front panel layout of the Harmonic Tone Generator (Figure 2,9) is 
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Figure 2.9 Harmonic Tone Generator Front Panel 
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divided logically into seven columns—one each for frequency, first harmonic, 
second harmonic, etc. Knobs for manual control of the parameters and for 
attenuation of the harmonic outputs, and connectors for parameter control 

inputs, individual harmonic outputs, and summed outputs are mounted on the 

panel. 

In summary, the specifications for the Harmonic Tone Generator, which 
have been verified by measurement, are: 

1, Number of Harmonics: 6 

2, Input impedances: 100 K or greater on all inputs, 

3, Input voltage ranges' (a) -10 to f 10 volts on amplitude inputs. 

(b) 0 to -11 volts on frequency input, 

(c) 0 to -16 volts on second harmonic phase input. 

4, Modulation frequency range: 3KC mm. on all inputs except second harmonic 
phase which is 200 cps, 

5, Output range: (.a) Fundamental frequency —0 to 2400 cps. 

(b) Harmonic amplitudes —„45 to 250 millivolts rms, 

(c) Second Harmonic Phase — 180°, 

6, Linearity of amplitude control: within 2 per cent max. 

7, Total distortion of individual harmonics: less than 2 per cent. 

8„ Amplitude "zero leak'': 55 db below full output. 

Frequency stability: 2 cps, or .5 per cent whichever is greater for 
constant temperature (72°F). 


9. 
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10. Output impedance: 30 or 600 ohms. 

For many technical reasons, these specifications do not live up to the 
requirements outlined earlier in this chapter. However, there are no theo¬ 
retical limitations and we expect to improve these characteristics. In¬ 
creasing the number of harmonics and installing phase controls for all of 
the harmonics is primarily a scale-up problem. On the other hand, technical 
refinement is needed to improve the frequency stability, frequency vs 
control voltage curve, and amplitude zero leak. 
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Chapter 3 

CIRCUITRY FOR THE GENERATION AND CONTROL OF 
PARAMETER VOLTAGES 

3.1 Introduction 

The electrical devices which are briefly described in Section 1.1, items 
(2) through (6), may be conveniently discussed as pertinent to the control of 
musical parameters. These devices were constructed primarily for the purpose 
of generating DC voltages to be used for the time-envelope control of the har¬ 
monic amplitudes and fundamental frequency of the Harmonic Tone Generator. 
However, as is mentioned in Sections 3.5c and 4.1, they may be used in con¬ 
junction with other devices which have been built or acquired by the Experi¬ 
mental Music Studio. Figure 3.1 shows a simplified block diagram of the param¬ 
eter control system and the possibilities for switching between the various 
units, including the Time-Envelope Networks, the Time-Delay Gate Networks, 
the keyboard, and the ^O-DC converters. The analog multiplier, though it is 
used for parameter control, is not shown because it is functionally independ¬ 
ent of the other units and not attached to the network of switches. 

It is possible to build a device which will generate over a finite inter¬ 
val of time a more or less arbitrary DC signal. Two methods for accomplishing 
this are the photo-waveform method, by means of which an envelope signal can 
be produced corresponding to a wave form traced upon an oscilloscope screen, 
and the universal function simulator method, whereby the envelope signal is 
specified (by adjustment of appropriate controls) at a number of discrete points 
within the finite time interval. Although from the scientific investigator's 
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Figure 3.1 Parameter-Control Voltage Synthesizer Simplified Block Diagram. 
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point of view the ability to specify arbitrary parameter contours is highly 
desirable, there are two principal reasons why I decided, for the present, 
not to use these types of units in favor of a less elaborate system for control 
voltage synthesis. First, since in general several distinct parameter voltages 
are required as inputs to the Harmonic Tone Generator—and even more will be 
required if this instrument is to be expanded m the future—, the cost of a 
set of arbitrary function generators, which are individually expensive would 
be high. Second, the setting-up of several arbitrary parameter contours using 
the methods described would be both time-consuming and clumsy. In electronic 
music, these are important considerations. Consequently, I decided to build 
a set of modules which would allow the control of the following "sub-parameters" 
which are believed to have perceptual significance for music: 

(1) t , the delay time, i.e., the time of beginning of the envelope 
signal relative to an arbitrarily placed time origin. 

(2) T , the attack time, i.e., the duration of the attack transient. 

(3) t , the decay time, i.e., the duration of the decay transient. 

u 

(4) A, the relative amplitude of the envelope signal. 

(5) p(t), an oscillatory perturbation signal whose amplitude and frequency 
are described by their own sets of sub-parameters. (Details are given in Section 
3.5.) This signal is generally added to the "mam" envelope signal. 

These units have proved very useful for providing a variety of attack and 
decay characteristics. The outputs of these units can be added or subtracted 
in order to construct more sophisticated parameter contours. On the other hand, 
prior to experimenting with this equipment, I had hoped that, as a by-product, 
it would be possible to simulate the attack characteristics of some of the 
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ordinary musical instrument tones. Thus far, it does not seem to be possible 
to do so with any degree of accuracy. Hence, it must be concluded that other 
sub-parameters which have not been taken into account by this system are needed 
to describe these tones. 

In several of the mathematical expressions which follow the convenient 
mathematical "step function" U(x) is used. It is given by 

x > 0, (3-1) 

x < 0. 


Simple modular control circuitry is now in operation which converts a 
voltage step input of either polarity and of at least eight volts magnitude 
(i.e., = V U(t), where V > 8) to a set of six outputs each having the 

form 

-t ’A 

V ^ = + 9A U(t') [1 - e ], (Attack Mode); (3-2a) 

OU o ■ 

f -[1/2T 2 - 7l/(2T 2 ) 2 + 1 /t| ] t ’ 

V = + 9A-U(t’) <e 

out - | 

-[I/2t 2 +-Jl/(2T 2 ) 2 + l/Tg ]t ' *) ~t'/r 3 

- e ) e 

j 

-t'/T 2 -t '/T 

= + 9A U(t')(l - e ) e for t « t (Decay Mode); (3-2b) 

where t' = t - and t is the time variable. 

T , the delay time, can be varied continuously in three ranges: 0 to 50 ms., 

0 to 500 ms., and 0 to 5 seconds. The attack time, t varies in steps of 0, 2, 

A 

3, 5, 10, 20, 30, 50, 100, 200, 300, 500, and 1000 ms.; the decay time, t in 

O 


U(x) = 


3.2 Time-Envelope and Gate Control 
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steps of .02, .04, .08, .12, .2, .4, .8, 1.2, 2 and 4 seconds. A is the 

relative amplitude of the envelope signal and can be adjusted between 1 and 
.1. Switches are provided for setting the outputs to either polarity and to 
either the Attack or the Decay Mode. 

An input amplifier circuit accepts two DC inputs, one positive and one 
negative. If either of the two inputs exceeds 8 volts in absolute magnitude, 
the output of the amplifier is +12 volts, whereas if they are both zero, the 
output is zero. The output of the amplifier is then applied in parallel to 
the inputs of six "Time-Delay Gate (T.D.G.) Networks". The function of each 
of these networks is to produce a step voltage output which is time-delayed 
by an adjustable amount with respect to the input and to turn on an arbitrary 
external signal in synchronism with the step voltage output. When the input to 
these networks is zero, the outputs are also zero and the external signals are 
turned off. The schematic for one T.D.G. Network is given by Figure 3.2. 

The R-C network at the input of the first transistor stage of the T.D.G. 
Network is arranged so that the input current builds up at a rate determined 
by a variable resistor and a shunt capacitor until this current reaches a thres¬ 
hold value and the two relays simultaneously are turned on. One relay is used 
to switch the output from zero volts (ground) to either positive or negative 
18 volts, and the other is used to turn on the external signal. When the input 
voltage drops to zero, current conducted by the silicon diode in shunt with the 
variable resistor rapidly discharges the capacitor, and the relays return to 
their normal positions. The time delay is actually quite independent of the input 
voltage, providing it is equal to or greater than 12 volts, because the zener 
diode limits the R-C network input voltage to this value. Resistors R R , R , 

1 « j 




i 


Figure 3.2 Time-Delay Gate Network Schematic 
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and R^ are adjusted to equalize the firing times of the two relays and to set 
the maximum time delay to exactly 5 seconds. The time delay due to the action 
of the relays themselves is approximately 2 ms. 

The step voltage output of each T.D.G. Network is normally applied to the 
input of one of six "Time-Envelope (T.E.) Networks", the schematic of which is 
given by Figure 3.3. Note that the networks can be set to Attack or Decay modes 
for either positive or negative step voltage inputs. Again diodes are used to 
speed up the discharge of the capacitors so that the networks will respond 
correctly to repetitive actuating inputs. The proper load resistance for these 
networks is 100 kohms, which is the standard input impedance of the Harmonic 
Tone Generator. Assuming that it is driven by a T.D.G. Network, the general 
forms for the output of a T.E. Network for both the attack and decay modes 

g 

are given by Equation (3-2). Normally, if A = 1, the attack time is t = 2x10 C , 

4 Ct 

The complexity of the expression for the Decay Mode output envelope signal, as 
given by Equation (3-2b), is necessary for an accurate definition in terms of 
these values for T and T . The variable series resistor connected to the output 
serves to attenua-te the output signal by a factor not exceeding 10. Simultane¬ 
ously, the attack time is increased to a maximum of 1.8 times its normal (zero 
attenuation) value, and the decay time is increased to a maximum of 5 times 
its normal value. 

Thus far, I have described only one out of the several input-output con¬ 
figurations of the T.D.G. and T.E. Networks which are possible. When all the 
inputs of each of the individual T.D.G. Networks are connected to the output 
of the input amplifier, the T.D.G. Networks are in "Parallel Mode." However, 
a switch is provided to place these networks in "Series Mode", whereby network 
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1 is driven by the input amplifier, but network 2 is driven by the voltage 
step output of network 1, network 3 by network 2, and so on. Either one or 
the other of these two modes may be the best one for accurately producing a 
rhythmic sequence of time delays, depending on the particular spacing of the 
sequence. The T.D.G. Networks can also be set in "independent Mode", m which 
case the individual networks can be actuated directly by six independent 
sources. Finally, they can be set in "AC Mode", whereby the outputs of two 
AC-DC converters are connected to the inputs of two independent sera of three 
T.D.G. Networks (the AC-DC converter is described in Section 3.4). 

In a similar fashion, the Time-Envelope Networks can be set in "Time-Delay 
Output Mode", as described directly above; or "DC Actuate Mode", in which case 
the T.E. Networks are connected directly m parallel to the input amplifier 
inputs, i.e., the T.D.G. Networks are by-passed. The T.E. Networks may also be 
set m "AC Mode" or "independent Mode" in which cases the inputs to these 
networks are exactly the same as those given for the corresponding modes of the 
T.D.G. Networks. 

The two inputs of the input amplifier can be set at any one of three 
positions: (a) "External Source", (b) "Manual" (a pushbutton is provided to 
actuate the system), or (c) "Keyboard Output" (described in Section 3.3). 

The six independent envelope signals may be applied to the Harmonic Tone 
Generator amplitude control inputs in order to program a completely different 
attack-decay characteristic for each harmonic. Also, since each harmonic ampli¬ 
tude is proportional to the sum of three separate control inputs to the Harmonic 
Tone Generator (See Figures 2.2 and 2.9), it is possible to program amplitude 
contours which are quite different from that depicted by Equation (3-2) . For 



example, the following envelope characteristic can be programmed using two 
Attack Mode envelope signals of opposite polarity: 
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...» r , l " (t ' T ib )U(t ' T lb )/T 2b 

U(t) 1 - e e 


* *» ✓._ v f i " t/T 2a1 . i f i ~ (t T lb )/T 2b 

= U(t) 1 - e - U(t - t ) 1 - e 


f ° r T lb >> T 2a 


(3-3) 


The expression on the left side of Equation (3-3) represents a signal which 

is characterized by an attack at time zero of duration , a steady state 

which lasts until t equals t and an exponential decay which begins at t = t 

Id lb 

with decay time constant T . As is indicated by the right side of Equation 

u D 

(3-3), this more complex envelope contour can be derived from the addition of 
two simple attack envelopes, the second being of opposite polarity and time- 
delayed with respect to the first. Other interesting envelope characteristics 
can be similarly constructed. An "overshoot" characteristic can be obtained by 
the addition of a short decay envelope to one of longer duration. 

When applied to the frequency control input, one of these signals will 
produce a frequency glide or portamento. In conjunction with an analog multi¬ 
plier (as described in Section 3.5) they can also be used to "shape" musical 
parameters such as vibrato and tremolo. 

3.3 Keyboard Pitch and Rhythm Control 

A convenient source of a pitch control voltage and simultaneously produced 
actuate voltage is an electronic organ keyboard which has been wired for a 
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series of adjustable pitches. When a particular key is depressed, a switch is 
closed making contact between a fixed reference voltage and one of a large 
parallel group of variable resistors (one for each key) which are all termin¬ 
ated by a fixed resistance (see Figure 3.4). The variable resistor of each 
voltage divider thus created is adjusted so that the output voltage, which is 
applied to the Harmonic Tone Generator frequency control input, determines 
that the frequency correctly corresponding to this key is generated by the 
Harmonic Tone Generator. The proper output voltage is determined by Figure 2.5. 
The pitch control voltage varies from -.7 to-11.5 volts corresponding to 

pitches between C and E when keys are depressed, but is zero when no key is 
& 7 

depressed. When playing this type of "parallel tuned" keyboard, the performer 
must be aware that two keys depressed will produce a pitch higher than both of 
the pitches corresponding to each of the keys. 

A+12 volt actuate voltage is derived from the pitch control voltage by 
means of the limiter-amplifier circuit as shown. The operational amplifier is 
set for a gam of 12/-.7 = -17 causing the output voltage to limit at about 

12 volts, its maximum output. Nevertheless, zero pitch control voltage produces 
a zero actuate voltage within a close tolerance. In this way both the pitch 
control voltage and the actuate voltage are produced from single contacts and 
are automatically synchronized. The actuate voltage, normally used to initiate 
attacks, must be reset to zero by lifting each key before depressing the next 
one. The capacitors m the T.D.G. and T.E. Networks must be discharged before 
a subsequent transient can be produced. Strict legato with no attack transients 
and constant intensity control can be performed only with the T.E. Networks 


set on Attack Mode. 
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Figure 3.4 Keyboard Schematic. 
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3.4 Rhythm Control by Tape and AC-DC Converter 
- ! 

A circuit which converts tone bursts whose frequencies are 5 kc or greater 
into DC actuate voltages has also been constructed, and another identical 
circuit is to be completed shortly. Two simultaneous sequences of tone bursts 
recorded on two-channel tape will be then used to actuate two sets of time 
delay and envelope controls, or they can be used to gate sounds according to 
the rhythmic pattern of the tone bursts. The circuit, as shown in Figure 3.5, 
consists of a limiter, gate, full-wave rectifier, low pass filter, and output 
amplifier. The rise time of the output voltage step is about 1 ms. By choosing 
the tone burst frequency to be 7000 - f , where f is the desired audio funda- I 

mental, the tone bursts can be used to simulate the Harmonic Tone Generator 
voltage-controlled oscillator. (An input is provided for this purpose, see 
Figure^2.2 and 2.9). Hence, the 1- same pitch and rhythm sequence can be repeated 
for different manual settings or variations of the attacks, harmonic amplitudes, 
frequency modulation, and other parameters. 

Rhythmic patterns of tone bursts may be recorded on magnetic tape by two 
methods. First, sections of tape containing a pre-recorded high frequency tone 
and sections of blank tape may be spliced together according to the desired f 

i 

j 

pattern. In certain circumstances this method may offer a modest advantage i 

' i 

over the usual procedure of splicing and measuring tones since it is not 
necessary to search out and splice at the beginnings of tones already recorded. 

Second, a digital computer can be programmed to produce square wave tone bursts 
according to an arbitrary time pattern specified as data for the program, and 
these tones can be recorded on magnetic tape. Composers have used tones generated 

! 

i 

by the University of Illinois CSX1 Computer at the Coordinated Science Laboratory i 


for this purpose. 
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3.5 Analog Multiplier 

A compact device which produces an output signal proportional to the 

instantaneous arithmetic product of two DC signals, irrespective of their 

polarities, was needed for the synthesis of more complex parameter contours. 

It should be noted that the two types of amplitude modulation devices used in 

most electronic music studios will not satisfactorily perform this function. 

One of these, the "ring modulator", is restricted to multiplying AC signals 

over a comparatively small dynamic range. The other type, the "voltage-controlled- 

gain amplifier", is restricted to the multiplication of a bi-polar signal and 

a mono-polar signal and, in addition, may have a non-zero DC offset voltage. 

Therefore, a special analog multiplier was designed and constructed which takes 

the two input voltages e and e , and multiplies them to produce the output 

X dt 


e 


out 


K e x e 2 . 


(3-4) 


This multiplier uses a carrier frequency method to derive the product of 
two signals. The first signal controls the amplitude of an ultrasonic frequency 
square wave, and the second signal controls the phase of another square wave 
of the same frequency. The two square waves are then compared by means of a 
phase detector. To be specific, one square wave with amplitude proportional 
to e^ is switched on and off in synchronism with the positive and negative 
portions of the other square wave which drives the phase detector. This latter 
square wave is generally out of phase with the former, and the deviation from 
90° of the phase difference between the two is varied by the control voltage 
e . Figure 3.6a shows the output waveform (before filtering) for six combinations 
of values e^ and e^. Given that the peak-to-peak voltage of this wave form is 
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e and the phase shift is TI/2 - ke , it follows that the average DC com- 
1 2 ^ 

ponent of the waveform is 


e = ke e /27T . 
out 1 2 


(3-5) 


The block diagram for a "phase shift" multiplier is given by Figure 3.6b. 


а. A Practical Phase Shift Multiplier 

Some basic requirements with respect to use of the multiplier for param¬ 
eter control as well as for general electronic music purposes are the following: 

1. Ideal input-output relationship 1 e out = K e i e 2' 

2. Bandwidth (amplitude control): 3 kc. 

3. Bandwidth e (phase control): 10 kc. 

4. Maximum DC input or output. + 10 volts. 

5. AC feed-through with = 0. 70 db below maximum output. 

б. AC feed-through with e = 0 40 db below maximum output. 

7. Maximum DC offset voltage at output: 10 mv. 

8. Distortion of input sine wave when other voltage is set to 10 volts 

1 per cent maximum. 

9. Input impedances* 100 kohms. 

10. Outputs: one DC output drives 10 volts into 10 kohms and one AC output 
drives 5 volts rms . into 600 ohms. 

The heart of the phase shift multiplier is the pulse shaping circuit 
shown m Figure 3.7. The output of the integrated chopper at the top of the 
diagram is a square wave alternating between 0 and -e , and the voltage devel¬ 
oped between the outputs of the top and bottom choppers is a balanced square 
wave alternating between -e^/2 and +e^/2. At the input to the output differential 
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Fxgure 3.6 Phase Shift Multiplier: (a) Output Waveforms 
(Before Filtering) (b) Block Diagram. 
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Figure 3,7 Phase Shift Multiplier: Pulse Shaping Circuit Diagram. 
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amplifier the voltage across the phase detector chopper alternates between 
this square wave and 0 volts according as the phase detector is positive or 
negative, respectively. Hence, the waveforms depicted in Figure 3.6a result 
and the low frequency output of the amplifier is ke e /27T. The bottom chopper 

X A 

is not used to switch the signal but is used for error current cancellation. 
When is zero, a series of ’’spikes’’ remains at the output of both choppers. 
However, since in this mode the chopper circuit configurations are identical, 
the spikes are essentially cancelled across the phase detector. This results 
in an improvement of the AC feedthrough with e^ = 0 by about 20 db. 

The method used for phase modulation is illustrated by the block diagram 
shown m Figure 3.8. The output of a 100 kc carrier frequency oscillator is 
AM-modulated by means of a chopper circuit (see Figure 2.8) and a second order 
Butterworth band pass filter with 20 kc bandwidth to produce an m-phase sine 
wave with amplitude e /2. The oscillator output is also shifted by 90° to 

<u 

produce a cosine wave with a fixed 2.5 volt amplitude. When these two quad¬ 
rature components are added, the phase of the resulting sinusoid is equal to 
7T/2 - tan ^e /5) relative to the phase of the oscillator output. However, if 
the magnitude of e is less than 2 volts, this relationship is linear within 
2 per cent, i.e., the phase is approximately equal to 77 /2 - e /5. [The linear 

a 

range for e^ could be extended to +10 volts if the buffer amplifier were pre¬ 
ceded by a diode function generator designed to deliver the output 5tan(e /5)]. 

2 

This relationship is also inherently symmetrical about 71 /2 for positive and 
negative values of eIt follows that k in Equation (3-5) is 1/5, and the 
output of the differential amplifier shown m Figure 3.7 is 

e 1 tan“ 1 (e 2 /5)/2 7T = e^/lOOT, for e g < 2. 


(3-6) 
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However, it was decided that the magnitude of the output of the entire multi¬ 
plier should be 10 volts when both inputs are ± 10 volts. Hence, the gain of 
the output amplifier (Figure 3.6) is adjusted to 27T/tan *(2), so that the 
relationship given by Equation (3-4) is valid for K = l/(5tan 1 (2)) and for 
|e^| less than 2 volts. 

Unfortunately, two anomalies of the circuitry and the technique limit 
the performance of the constructed multiplier with respect to satisfying require¬ 
ment 6, namely, the AC feed-through with e = 0, especially when e contains 
higher frequencies. The performance was improved by compensating for these 
anomalies at the input of the multiplier output amplifier. The maintenance of 
zero DC output when e = 0 depends on the cancellation of the areas of the positive 

A 

portion and the negative portion of the pulse wave [See Figure 3.6a center]. The 
response of a simple integrated chopper circuit [Figure 2.8] to both +10 and -10 
volt DC control voltages was measured, the rise and fall times of the output 
square wave for the positive control voltage were 320 ns. and 400 ns., respec¬ 
tively, whereas for the negative control voltage they were 200 ns. and 150 ns., 
respectively. The result of this asymmetrical response was that the output for 
e 2 = 0 had roughly the form 



This output error was partially cancelled by the introduction of opposite 
polarity voltages corresponding to these two terms. 

Also, when e.^ contains higher frequencies, the approximation that the 
pulse waves derived from sampling e.^ are square is no longer valid. The error 
in cancellation increases with increasing frequency of e . Therefore, an opposite 
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polarity version of which increases in amplitude with frequency (obtained 
by feeding the signal through a capacitor of optimum value) was used to compen¬ 
sate this error at the output, and a substantial improvement resulted. 

Some performance characteristics of the practical multiplier which have 
been measured are: 


(1) AC Feed-through 

frequency (cps.) A(db) B(db) Conditions: 


50 

-75.5 -42 

A. e = 0, e = 2v.rms. 

500 


. reference to e^ = -10 v., 

l; ooo 


e 2 = 2 v. rms . 

2,000 

-58 


3,000 

-55 

B. e = 2 v . rms., e = 0 

X * 

4,000 

-53 

reference to e^ = 2v.rms., 

5,000 

-52 

e 2 * 1 10v ' 

10,000 

-51.5 



(2) Harmonic Distortion (f=50 cps., e.^ = +10 v.) 


e (volts rms) 

Ct 


% distortion 
output 


at 


.5 

1.0 

1.5 

2.0 

2.5 


.54 
.68 
1.1 
1.6 
2 .5 


3.0 


3.4 
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b. Parameter Control by Multiplication 

An envelope signal may be considered to consist of two terms: (1) a smooth 
"average" envelope function A^(t) such as that given by Equation (3-2), and 
(2) a tremolo perturbation function p(t) with zero average value. An envelope 
characterized by a changing percentage perturbation can be produced by multi¬ 
plying a suitable envelope function A 2 (t) with a readily available steady-state 
low frequency signal s(t) having a constant rms . amplitude (such as a sine 
wave or filtered white noise). Hence, the total envelope signal in this case 
is 

A T (t) = A x (t) + p(t) = A 1 (t) + A 2 (t).s(t). (3-8) 

For example, the envelope for a simulated "vibraphone" tone can be constructed 

_t/T 

by taking A (t) = 0, A (t) = U(t)e , and s(t) = sin(27Tft) where t and f 

are the order of 1 second and 5 cps., respectively 

A (t) = U(t)e -t sin(l(Wrt). (3-9) 

T 

Analogously, the multiplier may be used to vary the amount of vibrato with 
respect to time. In this case, the "main" pitch control voltage is complemented 
with a vibrato perturbation signal of the same type depicted above. As an example, 
suppose that the Harmonic Tone Generator pitch control voltage is given by 

V f (t) = V q + A 2 (t) • s(t). (3-10) 

Then, providing A„(t)*s(t) = p(t) is sufficiently small, the instantaneous 
fundamental frequency at the output of the Harmonic Tone Generator is given by 
Equation (2-1) to be 

d a 

f(t) = q[ V f (t) ] = q(V Q ) + . A 2 (t)-s(t). 


(3-11) 
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If A (t) is an increasing function with respect to time, a tone can be created 
/!* 

which is initially "flat" and is gradually transformed into a tone with definite 
vibrato. 

c. Other Uses for Multipliers in Electronic Music 

Since the inception of electronic music, the transformation of spectrums 

from one pitch region to another has played an important role in electronic 

music composition. Usually this has been accomplished by the so-called "ring 

modulator", a multiplier of AC signals which derives its name from its circuit 

24 

configuration—a ring of four diodes. A sound, having a simple, changing 
harmonic spectrum and a distinctive attack-decay characteristic, which is multi¬ 
plied by a pure sinusoid can be transformed into a sound with a more complex, 
and usually inharmonic, spectrum retaining the original attack-decay pattern. 

If the spectrum of the original sound contains the frequencies f , 2f 3f ... 

ill, 

and the frequency of the sinusoid is f , the output of the multiplier is a 

a 

double side-band spectrum consisting of the frequencies ..., f -f , f -2f , 

6 1 a 1 

f +f , f +2f f ... The amplitudes of these frequencies are proportional to those 
of the corresponding harmonics of the original signal. Under these circumstances, 
it is essential that the frequency f not be present in the output spectrum 

a 

since its constant value represents an undesirable aural redundancy. The appear¬ 
ance of the "carrier" frequency f may be due to an mbalance m the modulation 

a 

with respect to the original sound or by a direct feed-through of f 2 to the 
output. Therefore, an analog multiplier can accomplish spectrum transformation 
satisfactorily only if it is properly balanced with respect to these two items 
and has adequate frequency response. 

24. Wilcox, R.H. • "Use of a Diode Ring as a Four Quadrant Multiplier," Review 
Scientific Instruments , 30, 1009-1011, 1959. 
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The multiplier may also be used for the instantaneous amplitude control 
of an arbitrary sound signal, where the amplitude control voltage is supplied 
by one of the Time-Envelope networks described m Section 3.2. Here it is 
important that the modulation be linear with respect to the original signal 
and that none of this signal feed through to the output uncontrolled. As 
discussed in Section 2.4c, a large dynamic control, range, at least 70 db, is 
needed to reduce the signal feedthrough (or "leak") to a tolerable level. 

d. Some Other Possible Designs for a Electronic Multiplier 

It is ironic that a signal accurately proportional to the sum of two elec¬ 
trical signals can be produced by so simple a circuit as the mere connection of 
two resistors, and yet the generation of an accurate product of the two signals 
inevitably requires a much larger number of components. Analog computation 
usually depends on the availability of natural devices which follow simple 
mathematical laws (a case in point: the capacitor as an integrator of current), 
but there does not seem to be in nature a simple, inexpensive, accurate device 
for multiplication, At this point, it seems desirable to review some other 
possible methods of electronic multiplication, besides the "phase shift" method, 
which were considered. 

Probably the most popular electronic multiplier design uses the "quarter- 
square" method. This type of multiplier consists of two squaring circuits and 
sum and difference amplifiers; hence, the output is given in terms of the input 
by the expression 

e l , e 2 = 1/4 [(e^+eg) 2 - (e x - e 2 ) 2 ] (3-12) 


A square law characteristic may be constructed as a linear-segment approximation 
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25 

to a parabola according to a diode function generator design or by taking 

26,27 

advantage of the natural square law behavior of certain electronic devices. 

The latter technique is hampered by the fact that none of the several physical 
devices used have true square law characteristics over a sufficiently large 
dynamic range. With the former technique, large dynamic range and accurate 
performance can only be achieved if large numbers of diodes and precisely 
adjusted resistors are used. Also, in this case, the derivative of the approxi¬ 
mation to a parabola contains many discontinuities which for certain types of 
input signals can result in the superposition of an undesirable high frequency 
"noise*' on the output signal. Both techniques involve stringent balancing 
requirements. 

Control voltages can be used to vary the gain of amplifiers or to vary 

voltage-sensitive resistive components (such as the field-effect transistor 

biased in the "pinch-off" region). Here again, dynamic range and balancing 

requirements restrict this technique for four-quadrant operation. 

In recent years, progress has been made in ultilizing magnetic fields to 

obtain multiplier action m conjunction with the Hall-effect and the magneto- 

27 

resistance effect m semiconductors. For example, the output voltage of the 
Hall-effect multiplier is proportional to a flux density B and a Hall element 
current I , while B is proportional to the current I in a coil which produces 
the magnetic flux. However, the main difficulty with this method is that in 

25. Marshall, B.O. : "An Electronic Multiplier", Nature , 167 , 29, 1951, 

26. Berger, F.B,, MacRae, D.: "Mathematical Operations on Waveforms-Il", 
Waveforms , chap, 19, 679-686, M,I.T, Radiation Laboratory Series, Vol. 19. 

27. Hilsum, C.: "Multiplication by Semiconductors," Electronic Engineering 
30 (369 ), 664-666, 1958. 
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order to maintain a constant flux density over a wide frequency range, either 
the coil drive voltage must be very large for high frequencies (when large 
inductances are used) or the constant coil drive current must be^ very large 
(when small inductances are used). This means that the maximum required 
signal power at the coil input is proportional to the highest signal fre¬ 
quency. At present, these power requirements are too severe for wide band 
applications where a reasonably large output signal is required (1 volt). 

Hence, the magnetic multipliers are probably best suited for low frequency or 
power applications. 

Carrier frequency methods of multiplication seem more promising, 
since techniques for nonlinear approximation and simulation are replaced by 
techniques for the control of sawtooth waveform linearity and the areas of 

•s. 28 

square pulses. The average DC component contributed by a square pulse wave 
is equal the product of the pulse height and the pulse width divided by the 
pulse period (a constant). Hence, if voltages e and e control the height 
and width of a recurrent square pulse, respectively, the average component is 
proportional to their product. (The high frequency pulse components are removed 
by a low pass filter ). This can be accomplished by means of an electronic 
switch which turns on the voltage e^ only when the switch actuating voltage is 
positive. If the sum of a sawtooth or triangle waveform (normally entirely 
negative) and the voltage e is applied to the actuate input, the switch is 
turned on for a time proportional to e , for positive values of e This is 
inherently a two-quadrant multiplier, although a four-quadrant multiplier can be 


28. Kraicer, A.: "Two Four-Quadrant Electronic Multipliers," Radio and Electronic 
Engineer . 25(1) , 5-11, 1963. 
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formed either by positive biasing of e at the switch actuate input, forming 
the product e (k+e ) = ke + e e , and subtracting ke at the output, or 
by applying -e^ and -e^ to an otherwise identical electronic switch and adding 
the outputs of the two switches. 

A similar type of carrier frequency multiplier, which is inherently a 
four-quadrant device, is a "phase shift" multiplier. A phase shift multiplier 
was built for use in the parameter control system and is described in Section 


3, 5a. 
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Chapter 4 

VARIABLE FORMANT FILTER (PROGRAM EQUALIZER), THEORY AND DESIGN 
4.1 Introduction 

A variable formant filter was designed and constructed, which is capable 
of assuming a large variety of shapes (frequency transfer characteristics) 
over a fixed frequency range. It can be used for studies of the audible effect 
of harmonic tones with different formant characteristics. The frequency range 
of the filter is partitioned into a number of band regions which are logarith¬ 
mically spaced on the frequency scale. Ideally, formant curves are to be approx- 
mated m staircase fashion by altering the intensity of each band to coincide 
with the corresponding point on the formant curve'. Physically, this is accom.-< 
plished by a device containing a set of band pass filters whose center fre¬ 
quencies are logarithmically spaced (a fixed number per octave of frequency) 
and whose transfer characteristics are logarithmically symmetrical and identical 
when scaled to the same center frequency. The signal to be filtered is applied 
in parallel to each of the filter inputs, each filter output is connected to 
the input of a variable attenuator, and the total output of the device is formed 
by the sum of the individual attenuator outputs. (See Figure 4.1.) Electronic 
instruments of this type have been used in the broadcasting industry for some 
years and are called "program equalizers". However, data for the design of such 
instruments do not seem to be available. 

As I discussed m Section 1.5, a voltage-controlled subtractive synthesizer 
can be formed by the connection of the output of a voltage-controlled oscillator, 
whose waveform contains a large number of harmonics, to the input of a formant 
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filter, whose shape can be varied by means of control voltages. The shape control 

can be implemented using the variable formant filter just described, since this 

device can be easily modified for the functional replacement of the variable 

attenuators with voltage-controlled amplifiers. Fortunately, Robert Moog has 

recently designed and made commercially available voltage-controlled oscillators 

29 

and amplifiers for use in the electronic music studio. Moog's oscillators 
provide a choice of sine wave, square wave, pulse wave, sawtooth wave, and tri¬ 
angular wave outputs, which are basic waveforms for subtractive synthesis Also, 
the voltage-controlled amplifiers in conjunction with time-envelope voltages 
supplied by the equipment described in Chapter 3 can be used to provide attacks 
on sounds produced by subtractive synthesis. Hence, these units complement very 
nicely the additive synthesis system described m Chapters 2 and 3. The 
Experimental Music Studio has acquired some of these units and they are presently 
being used in conjunction with the Variable Formant Filter to be described in 
this chapter. 

It is evident that the number of degrees of freedom for synthesizing a 
given frequency characteristic with a variable formant filter is proportional 
to the number of filters per octave (i.e., the filter density) which is employed. 
However, economic considerations and the need for a wide audio frequency range 
(40 cps. to 16,000 cps.) limit the practical density of filters to a low number 
per octave. In this chapter 1 will concentrate on finding the "best" low-order band 
pass response characteristic which is compatible with a given filter density. 

29. Moog, R.A. : "Voltage-Controlled Electronic Music Modules," J. Audio Eng. 

Soc. , 13(3) , 200-206, 1965. 
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The calculation of the composite response of the formant filter is complicated 
due to the necessity of taking into account the phases as well as the ampli¬ 
tudes of the band pass filter outputs. 

4.2 A"Best" Criterion for Formant Curve Simulation 

Arbitrarily, a criterion has been chosen such that the response of the 
adjustable formant filter corresponds closely as possible to that which Is 
implied by the settings of the attenuators for two extreme cases: 

(1) All attenuators set for unity gain (0 db); then the output should be 
flat over the range of the filter system. 

(2) A single midband attenuator is set for zero gain (-» db); in this 
case, the output at the corresponding midband frequency should be 
zero. 

These conditions correspond to characteristics which are useful in musical 
applications: (1) Equal transmission of all frequencies ("all-pass" character¬ 
istic ). (2) Rejection of one or more discrete frequencies ("notch" character¬ 

istic). The third characteristic which is useful—boosting of a single fre¬ 
quency—is Inherent in the type of system chosen. The maximum amount of boost 
at one frequency relative to the others is, of course, obtained when all the 
attenuators are set to zero gain, except the attenuator connected to the filter 
passing this frequency, which is set for unity gain. 

The formant characteristics of ordinary musical instruments are characterized 
to a high degree by resonances and antiresonances, i.e., distinct maxima and 
minima. Therefore, from the standpoint of their simultaion, it is advantageous 
to have a system capable of producing sharp peaks and notches at discrete fre¬ 
quencies . Also, since each value of the overall frequency characteristic varies 
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continuously as a function of the posric-ons of the attenuators (with those atten¬ 
uators corresponding tc the filter band centers nearest to this value having the 
largest influence), it is evident that an infinite variety of formant character¬ 
istics can be simulated and, to an accuracy of one bandwidth and with a certain 
restriction on the maximum slope of the curve, any desired formant shape can be 
simulated. A qualitative notion of the maximum slope restriction is indicated by 
the peak and notch characteristics of the filter set. 

4.3 Computation of Ripple and Notch Characteristics of an Infinite Set of 

Band Pass Filters 

The mid-region frequency response of a large set of band pass filters 
with unattenuated outputs summed together is very close to that of an infinite 
set of such filters, assuming that we can neglect the contribution of the filters 
whose center frequencies are sufficiently removed from the mid-region. Because 
of the logarithmic symmetry of the filters under consideration, their congruent 
shapes, and their even spacing on a logarithmic frequency scale, the infinite 
set frequency characteristic is logarithmically periodic. The period is equal 
to the spacing between the filter band'centers. Also^the response has mirror 
symmetry about the band center frequencies and the geometric means of adjacent 
band center frequencies. Hence, the entire response function is characterized 
by a single half-frequency interval between one of the band centers and an ad¬ 
jacent geometric mean frequency. 

It is shown below that among other properties of an infinite filter set, 
the complex response function is real at the half-interval end points and that 
relative maxima or minima of the magnitude function occur at these points. 
Evidently, when the bandwidths of the individual filters are sufficiently small 
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(assuming their own maxima are at tand center), the total response maxima lie 
at the band center frequencies and the minimum values lie at the geometric mean 
frequencies between them. It is possible for sufficiently large bandwidths of 
filters of order greater than 1 (the tuned circuit) for other maxima and minima 
to appear. Nevertheless, over a good range of percentage bandwidth values, the 
extent of which is tested by a different method described later in this chapter, 
the maximum and minimum values can be inferred to be equal or close to the re¬ 
sponse function evaluated at a band center frequency and a geometric mean fre¬ 
quency. Therefore, the ripple, which describes the closeness of approximation 
to an all-pass characteristic, can be computed to be the difference in decibels 
between these two values. Likewise, the "notch" attenuation can be computed as 
the difference between this maximum decibel value at a band center and its value 
when the corresponding band pass filter is left out. The advantage of this method 
of determining the ripple and notch characteristics, is that for each filter type, 
value oi percentage bandwidth (or its inverse, Q), and filter density, only two 
real computations are required. 

In the remainder of this section of this chapter, some basic definitions 
are made and the fundamental properties of infinite "all-pass" filter sets out¬ 
lined above are proved. The response characteristic of each of the band pass 
filters of the set is derivable from that of an equivalent rational low pass 
filter response function, f(s). Parameters which are required to define the 
response characteristic of the infinite filter set are (1) the coefficients of 
the numerator and denominator polynomials of i(s); (2) the filter density N, 
and (3) the band pass filter Q. The "ripple" € and the "notch 11 attenuation 
a are computed by means of Equations (4-6) and (4-7), respectively. Curves of 
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£ vs. Q and £ vs. a with N as a parameter for three assumed forms for f(s) are 
given in Figures 4.2 through 4.7. 

a. Assumed Form of Low Pass Transfer Funtion f(s) 

Let us assume that: 

(1) f(s) is a rational function of the complex (frequency) variable s. 

(2) There exists a real value and a positive integer q such that for 
large values of |s|, f(s) is asymptotic to C^/s^. 

(3) The poles of f(s) lie in the left-half portion of the complex s plane. 

(4) f(o) = 1. 

Besides these technical assumptions, let us also assume that the low 
pass magnitude function [f(jw)| is one of the low pass types with a cutoff fre¬ 
quency, CO = 1. 

b. Properties of Band Pass Transfer Function g(y) 

The band pass transfer function with unity center frequency and percentage 
bandwidth 1/Q is defined by 

g(y) = f(jQ(y - i/y)> (4-1) 

which has real and imaginary parts Rg(y) and Ig(y), respectively. 

From the four assumed properties of f(s) and from Equation (4-1), the 
following three properties of g(y) can be derived: 

(5) g(y) = g*(l/y) from which it follows that g'(y) = -(l/y) 2 g*' (1/y) 

2 

- - (l/y) g'*(l/y). (* denotes complex conjugate). 

(6) g(l) = 1 implying that Rg(l) = 1, Ig(l) = 0. 

(7) Rg’(l) = 0. 

Proof of (5). (6). and (7): 


Since f is a rational function, it can be split into even and odd parts which 
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correspond to the real and imaginary parts of f(jw), respectively. Hence { 

^ f(jw) and g*(l/y) a f+[jQ(l/y - y)] ■= f[-jQ(l/y - y)] 

* 

= f[jQ(y - l/y)] = g(y), which establishes Property (5). 

Property (4) implies Property (6) since g(l) - g(0) - 1. 

The real part of f(jto) is even, implying that its derivative is odd and 
Re[f'(0)J - 0. Property (7) follows since Re[f*(0)] - Rg'(l). 

g(y) is a band pass filter transfer function with band center frequency 
y = 1. However, the argument of g can be scaled to produce a filter function 
with a band center at y i.e., g(y/y Q ). A set of congruent, evenly-spaced 
filter functions is given by 


h fe (x) .= g(x/p k ) for k = 0, 1, 2, 3, . . . ; -1, -2, -3, . . . ; (4-2) 

where p > 1. 

i/n 

If there are to be N filters per octave, p •= 2 
c. Properties of h (x) 

We can next establish the following properties of h^fx): 

(8) h R (x) = h* (1/x). 


(9) h k (p k ) = 1. 

(10) For m and n nonequal integers, h (p^ m+n ^ //2 ) ^ h 

m n 


= 2Rg(p ( “' n)/2 ). 

(11) h'(p (, " +n) ' /2 ) + h'(p ,,+n)/2 ) . 
m n 


j(2/p” , )Ig'(p ( - ,,+n,/2 ). 


(12) S h^(x) converges for all real 
k--oo 

oo 1 oo 

S h (x)( s 2 h»(x). 
k=-oo k J k——oo * 



x. 
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Proof of Properties (8) through (13): 

h k (x) a g(x/p k ) as g*(p k /x) =. g*[(l/x)p k ] = h* fc (l/x) establishes Property (8). 

h k (p k ) = g(p k /p k ) - g(l) - 1, which establishes Property (9). 

h (p <m+ “ ,/2 ) + h (P 0n+n>/2 ) . gCp- 1 "-^ 72 ) *■ g(p (,1 -“ )/2 ) = g.(p <m - ,l)/2 ) 

m n 

+ g(p^ m n ^ 2 ) = 2Rg(p^ m a ^ 2 ), which establishes Property (10). 

h'(x) + h*(x) = ~ [g(x/p m ) + g(x/p n )] - (l/p m )g , (x/p m ) + (l/p n )g'(x/p n ) 
m n ax 

= (l/p m )g'(x/p m ) - <p n /x 2 )g' + (p n /x); hence, hUp^ 11 ^ 2 ) + h^(p* m+n ’ >//2 ) 

= (l/p m )g , (p ( ' m+n)/2 ) - U/p m )g'*(p ( ' m+n)/2 ) * j(2/p m )Ig'(p ( “ m+n)/2 ), 

t* 

establishing Property (11). 

By comparison to the geometric series, the series of terms h (x) converges; 

JK 

that is, for large values of k, 

Q 

h ± . (x) - fljQU/p 1 * - p^/x)] ~--""5 = °[^/p q ^ k ]« which 

k [jQ<x/p ±k - p Vx)] q 

establishes Property (12). 

By Property (2), f(s), which is analytic in a deleted neighborhood of infinity, 
can be represented by its Laurent series, 

f(s) - C^/s 13 + Cg/s q+1 + . . ., 
and its derivative can be represented by 

f’(s) = -qC ;L /s q+1 - (q+l)C 2 /s q+2 + . . . 
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Consequently, if x is restricted to a finite closed interval not including 

q 

zero, if can be readily shown that h' (x) =: o[(l/pj ] for x in the interval, 

+k 

00 

and by the Weierstrass M-test, the series 2 h’(x) converges uniformly for 

k=-oo k 

all x in the interval. Hence, by a theorem on the differentiability of an infi- 
30 

nite series the series can be differentiated term by term at any finite non¬ 
zero point, establishing Property (13). 

d. Properties of Composite Transfer Response 

The "all-pass" transfer response of the infinite filter set is 

00 

H(<o) a S h (<o) (4-3) 

ks-w K 

Some properties of H(w) are: 


(14) 

H(<o) is 

log periodic, i.e. H(w) =. H(pu) . 


(15) 

ii 

/—s 

3 

N-' 

X 

H*(l/to) . 


(16) 

MH(,1) - 

00 

H(l)j =s 1+2 2 Rg(p k ). 

k-l 

(4-4) 

(17) 

MH(p 1//2 

) « 2 E Rg(p <k+1)/2 ) . 

k=0 

(4-5) 

(18) 

MH'(l) 

= 0. 


(19) 

1/2 

MH*(p ) .= 0. (i«e«, relative maxima or minima occur at 



<0 - 1 

. V2. 

and co - p ) 



°° k 00 k-l 00 k’ 

Proof of (14): H(pw) = 2 g(pco/p ) - 2 g(co/p ) = 2 g(co/p ) 

-oo -oo -oo 


30. Olmstead, J.M. ■ Real Variables , Appleton-Century-Crafts. New York, 1959, 
p.-283. 
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= H(u>) where k’ *= k - 1. 


OO 


Proof of (15): H*(l/u>) - S h *(1A>) = 2 h -k (w) = 2 \ (a)) = H(w) 

-OO -OO -OO 

oo OO 

Proof of (16): H(l) = 2 h (1) .= h (1) + 2 (h (1) + h ,(1)). 

K. O 1C “ K 

-00 1 

Property (10) for m=k, n=-k yields h^d) + h k (l) = 2Rg(p k ). 

Since H(l) is a real number, MH(1) := H(l). 

Proof of (17): H(p 1/2 ) = 2 h k (p 1/2 ) = 2 < h k+1 <P 1/2 > + k <p 1/2 )>- 

-°° 1 

Property (10) for m - k+1, n = -k yields h (p 1 ^ 2 ) + h (p 1//2 ) 

KtI •—K. 

OD . (2k+l)/2. 

= 2Rg(p ), a real number. 


Proof of (18): MH 2 (oj)= |h(w)[ 2 = Re 2 [H(w)] + Im 2 [H(co)]. 


The derivative of MH can be computed as 


MH'(co) = 


MH(w) 


Re[H(w)J . Re[H'(w)] f Im[H(w) . Im[H'(co)] 


Then MH' (1) ^ [ H(l) .Re[H'(1) 3 + o] . 


H' (1) -S 2 (1) = g’(l) + 2 (h^(l) + h^ k (l)) 

-oo k.=l 


= g'(1) + j S (2/p k )Ig'(p k ) , by Property (11) with m=k and n=-k, and 
k=l 


Re[H'(1)] = Rg' (1) = 0. 
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Proof of (19): 


H'(p 1/2 ) = Z 


I-(p 1/2 > = 


- —75 [H(p 1 / 2 ).Re[H'(p 1/2 >] + o] 

MH(p 1/2 )L J 


h k + l (pV2) + h -k ( P 1/2 > 


= J 2 (*/p lt+ 1 )Ig l [p^ 2 k+l)/ 2 )j^ yeal part of which ia zero. 
0 


e. Ripple and Notch Calculations for Three Assumed FormB of f(s) 

As was indicated previously, the "ripple" €, which is a measure of the 
deviation of the frequency response from a flat characteristic, is defined ms 

€ = 20 log M a ri.i . ffM . 1 .. I (4 

10 Min[|H(w)| ] 

and the "notch" attenuation is defined by 


a = 20 log 


H(l) 


10 


[H(l) - h 0 <l>] 


(4-7) 


When Q is sufficiently large, we can make the approximations. 


Max[ |H(u>){ J = H< 1 ) = 1+22 Rg(P ), 

k=l 


(4-8a> 


Min[|H(w)| M H(p 1/2 ) = 2 S Rg(p k+1/2 >. 

k =0 


(4-8b) 


Therefore, by substituting Equations (4-8a) and (4-8b) into Equations (4-6) and 
(4-7) we can derive the following formulas for £ and a : 




iS’W’SV**#- i»W£ ?*«»»*, 
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e = 20 iog 10 <; 


oo 

1 + 22 Rg(p k ) 

_k=l_ 

2 2 Rg(p k + V2 > 
k=0 


> decibels. 


(4-9) 


a = 20 log 


10 


1 + 


2 2 Rg(p ) 
k=l 


decibels. 


(4-10) 


Except for the first-order filter, if Q is not sufficiently large, the 

true value of € will be larger than that given by Equation (4-9) since the 

true maximum and minimum values of |H (co)| are liable to occur at other points 

1/2 

than co = 1 and to = p . Under these conditions wa only know that c cannot 

be less than the value given by this equation. However, correct values of € for 
small values of Q will be computed by a procedure outlined in Section 4.4. 

Values of € and Q given by Equations (4-9) and (4-10) were calculated using 
the IBM 7094 computer for various values of N and Q and for three forms of f(s). 
These are the Butterworth functions of orders one, two, and three. The magnitudes 
of these functions for s = ju are: 


Mf i (Jw) = ^f == T r ' 

Mi + 


Mf 2 (jco) 


1 + w* 


Mf.(jco) = - =■ = - 

3 Ml + u 6 


(4-lla) 


(4-llb) 


(4-llc) 


These correspond to the following complex filter functions; 


f 1 (s) = 


1 + s 


(4-12a) 
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f 2 (s) = 


1 +-\f2s + 


(4-12b) 


f 3 (s) = -i-- , 

1 t 2s + 2s 2 <- s 3 


(4-12c) 


whose real parts for s = jco are given by 


Rf 1 (w) = 


1 + w 


(4-13a) 


Rf 2 (w) = 


1 - of 
1 + 0> 4 


(4-13b) 


Rf 3 (cj) = 


1 - 2h> 

1 + w 6 


(4-13c) 


f . Numerical Evaluation of the Infinite Series 

Given the real part of f(j<o) and values of Q and N=log (p), the two series 

A 

to be evaluated are 


2 Rf[Q(p k - p" k )J and 2 Rf[Q(p k+iy2 - p^"^ 2 )] 

k=l k=0 


It will be clear from the discussion that follows that a finite sum approx¬ 
imation of the second series will be at least as accurate as that of the first 
series for the same number of terms. Therefore, only the first sum will be 
treated. 

Suppose that, for k greater or equal to some fixed integer K, the general 

term of a series a, satisfies the condition 
k 

a. , < r a, where r < 1. (4-14) 

kfl “ k 
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It follows that, for i, a non-negative integer, 

Vt 5 V 1 - <4 - 15) 

The error between the value of the infinite series S and the finite approximation 


to S, S , is 


1 1 

OO 

s - s. = 


1 kl 

i=k+l 1 


CO 

< S l a J 

i=k+l 


< 2 lair 

i=0 1 K1 


|\| = |\|a/d-)-D, 


which reduces to 

Error < 


- <1 


Kl 


k >> K. 


Then the fractional error is restricted by 

N 

^k 


? t? 1^1 1 lr ^ V 

• E< - ST Tl/r - -' 1) » k > K - 


(4-16) 


(4-17) 


Given that w(k) = Q(p k - p k ) = 2Qsinh[k* ln(p)] and a = Rf[w(k)], 


Condition (4-14) can be rewritten as 


1 < - < 
r “ 


which is equivalent to 


1/1 Rf 

u)(k+l)H 

1/| Rf 

w(k)]j 


, k > K. 


(4-18) 


0 < ln(i) < In 
r 


1 

— in 

I 

|Rf [u(k+l)]j 


|Rf[w(k)]| 


(4-19) 


By the Law of the Mean there exists a number z between k and k + 1 such that 



94 


d_ 

dx 


in 


( jRf [w(x) ]|) 


In 


X-Z 


fR£[w(k+l) 


ttI - in 


r i 


Rf[w(k)r 


(4-20) 


Hence, a condition which is sufficient to imply Condition (4-18) for x > K is 


-Rf'fw(x)] w'(x) 
Rf [co(x)T 


> t > 0. 


(4-21) 


Note that 

u' (x) = 2Q ln(p) cosh [(x ln(p) ] > 2Q ln(p) sinh [x ln(p) ], (4-22a) 

* v 

or 


u' (x) > w(x) ln(p) o (4-22b) 

Finally, since u is a monotonically increasing function. Condition (4-18) 
can be replaced by the criterion 


-Rf ' (to) co ln(p) 
Rf (w) 


> t > 0, w 


W > 0. 


(4-23) 


The constants need for the evaluation of the fractional error by Equation 
(4-17) are given by 

r = e t (4-24a) 


k > - 


NW 


N 


- 2Q In (2 ) 


In (2 ) 


sinh 


W 


2Q 


w _1 (W) . 


(4-24b) 


An analysis of the real parts of the first three Butterworth low pass 
filter functions given by Equations (4~13a,b,c), indicate that adequate values 
of r and K for the three cases are* 
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r i - r 2 *- 1/P ’ r 3 1/p2 > 

K x - K 3 - N/Q, K 2 » 1.4N/Q 

Values of ripple £ and notch attenuation a defined by Equations (4-9) and 
(4-10) were next computed such that the accuracy of the infinite sums involved 

were within 1 per cent of their correct values according to Criterion (4-17). 

Graphs of £ and a in decibels are given m Figures 4.2 through 4.7 for values 
of Q between 1 and 20 and N between 1 and 10 for the three filter types. Note 
that in each case for large values of N, the curves of a vs. Q (Figures 4.3, 

4.5, 4.7) tend to converge to limiting positions. 

4.4 Calculations for Finite Sets of Band Pass Filters 

In the case of the infinite filter set utilizing first-order filters, there 
is only one pole per interval of repetition of the transfer function. Therefore 
there can only be one maximum and one minimum value of the transfer function per 
interval. From the analysis given above, these critical values must occur at 
the band pass filter center frequencies and the geometric mean frequencies be¬ 
tween them. In this case, the absolute value of € gives the correct ripple value 
for all values of Q and N. However, for higher order filters the number of poles 
per interval is greater than one, and the possibility exists, for comparatively 
large values of N/Q, that other maximums and minimums will occur. Consequently, 
a digital computer program utilizing complex number arithmetic, was written to 
calculate several points per repetition interval of the composite transfer function 
of n filters in order to determine the values of true ripple for comparatively 

large values of N/Q. That is, the function 

n-1 

2 f[jQ(w/P - p /w) ] 
k^. 0 


(4-25) 





Figure 4.2 


Infinite Filter Set: Ripple € vs. Filter Q (Filter Density N 
is a Parameter), First Order Butterworth. 
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Figure 4.3 Infinite Filter Set: Ripple € vs. Notch Attenuation a 

(Filter Density N is a Parameter), First Order Butterworth. 
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Figure 4.4 Infinite Filter Set: Ripple € vs. Filter Q (Filter Density N 
is a Parameter), Second Order Butterworth. 
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Figure 4.5 Infinite Filter Set: Ripple e vs. Notch Attenuation a 

(Filter Density N is a Paraaeter), Second Order Butterworth. 
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Figure 4.6 Infinite Filter Set* Ripple € vs. Filter Q (Filter Density N 
is a Parameter), Third Order Butterworth. 
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Figure 4.7 Infinite Filter Set: Ripple € vs. Notch Attenuation n 

(Filter Density N is a Parameter), Third Order Butterworth. 


TOT 


















102 


was plotted for values of cj between 1/2 and 2 p n 1 for appropriate values 

1/N 

of n, N(where p - 2 ) and Q for the second and third-order cases. The pre¬ 

cision of the values derived in this manner, as applied to infinite filter sets 
of the same type, was inferred from the periodicity of the transfer function, 
which is optimum on the interval midway between oo = 1 and w = p° The curves 
of £ vs. Q and a vs. €. were actually derived by adjoining the data obtained 
by the infinite filter set analysis (solid curves) with those obtained from the 
analysis of sets with a large number of filters (dashed curves). 

4.5 A Practical Variable Formant Filter 

Because of cost considerations, the breadth of the conventional rack mount 

panel, and the width of available linear attenuators, the number of filters was 

restricted to 12 for the first prototype model constructed. Also, due to the 

cost of manufacturing higher order band pass filters (utilizing a corresponding 

larger number of components with increasingly stringent tolerances), the order 

of the filters under consideration was limited to three. A decision was made, 

based on subjective experience and from the review of available data on musical 

2 

instrument formants, that a filter density of 3 filters per octave would satisfy 
the requirements for selectivity and that the frequency range 189 cps to 2400 cps 
would cover the principal region of interest for musical formants. Also the 
ripple was specified not to exceed 3 db and the notch attenuation due to the 
infinite attenuation of a single mid-band filter was to be at least 15 db. 
According to the data represented by the curves given, the largest values of a 
obtainable under these conditions are 10.8, 10.1, and 22.9 db for the 1st, 2nd, 
and 3rd order Butterworth filters, respectively. Note that the second-order 
filters offer no improvement over the first order types. It was clear that the 
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third-order types were required. Since the minimum ripple possible for this 
type for N = 3 is .6 db (a critical point where equi-ripple response occurs), 
corresponding to a = 20.6 db (only slightly worse than 22.9 db), filters 

which achieve this characteristic having Q - 5.5 were chosen. The specifi¬ 

cations for the band pass filters for the Variable Formant Filter are: 

1. Response Shape: Maximally Flat (Butterworth) Third Order. 

2. Percentage Bandwidth- 18 per cent. 

3. Center Frequencies- 189, 238, 300, 378, 475, 600, 756, 953, 1200 
1510, 1905, and 2400 cps. 

The response characteristics of filters supplied by White Instrument 
Laboratories (Austin, Texas) were measured. The center frequencies were all 
within 3 per cent of the specified values, the percentage bandwidth varied 
from 17.2 percent to 18.3 percent and the attenuations at twice the center 
frequency were all within 4 db of 55 db, the predicted attenuation. 

The ripple of the summed response of the Variable Formant Filter, which 
was constructed using these band pass filters, was measured to be less than 
1 db and the notch attenuation due to the removal of a single filter at midband 
was measured to be approximately 21 db. 

Variable linear attenuators (Lumiten 678) were purchased from Fairchild 
Recording Equipment Corporation. The vertical position of a knob mounted on the 
front panel of one of these units corresponds to the attenuation of an input 
signal, as is indicated by a decibel scale drawn on the face of the front panel. 
The knob varies the brightness of a lamp by means of a variable resistor in 
series with a fixed voltage (6.3 v.)* The lamp, m turn, operates to vary the 
resistance of a light-sensitive photoresistor, which is the attenuating element 
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of the circuit. 

There are two principal advantages with respect to using this type of 
attenuator in musical applications. First, the fact that the attenuation of 
each filter output is indicated vertically means that formant contours set 
up visually according to the positions of the knobs will at least in a qua- 
litive sense correspond to the actual formant characteristic. Second, the 
use of a lamp and a light-controlled resistor to smooth out (by virtue of the 
sluggishness of their responses) the abrupt changes between resistor contacts 
prevents the introduction of "noise" into the signal output, which ordinarily 
occurs when the control knob is moved rapidly across a wire-wound resistor. 

Unfortunately, m the process of constructing the Variable Formant Filter, 
we also discovered several disadvantages with these attenuators. First, the 
actual measured attenuations did not closely coincide with the scales marked 
on the front panels because the characteristics of the lamps and photoresis¬ 
tors vary considerably from attenuator to attenuator. Second, the attenuation 
is very sensitive to changes m lamp voltage; hence, the lamp voltage must be 
held within .1 volt. Third, the lamps may burn out, causing total failure of 
the attenuator. Fourth, the response to an increase in attenuation is quite 
slow due to the finite time it takes a lamp to lose its brightness. Fifth, there 
is no infinite attenuation setting, the lowest setting possible is about 70 db. 

While experimenting with the Variable Formant Filter, we found that the 
maximum notch attenuation does not occur when a mid-band attenuator is set for 
zero gain. The greatest notch attenuation was measured to be 40 db when the 


filter attenuation was set to 21 db. 
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The present Variable Formant Filter *oes not cover the entire useful fre¬ 
quency range for electronic music applications. Frequencies below 189 cps. and 
above 2400 cps. are severely attenuated. In the near future, we expect to ex¬ 
pand this instrument in order to cover the entire range, 60 to 16,000 cps. Since 
the ear is not as selective in separating frequency bands m the lower and upper 
extremes of the audio spectrum, it may be sufficient to use filter sets with 
lower filter densities in these regions. Alternatively, filters with sharp- 
cutoff characteristics on the sides adjacent to the rest of the filter set and 
with variable "roll-off" characteristics on the other sides of their center fre¬ 
quencies might be used to fill out the high and low ends of the audio frequency 
spectrum. 

4.6 Computer Simulation of Variable Formant Filters 

The same program used to evaluate the ripple of a finite filter set can 
be used to simulate an adjustable formant filter. That is, the frequency response 
of this filter, specified by the filter density N, the total number of filters 
n, the individual complex filter transfer function [given by the low pass equiv¬ 
alent, f (s)], and the filter Q, is printed out for a specified set of filter 
attenuations in decibels. Figure 4.8 illustrates the results of an attempt to 

fit the decibel curve -20 log^o) with the frequency response of the variable 

3 2 

filter which was physically realized (N-=3, n=12, f(s) = l/‘(fc + 2s + 2s + 1), 

Q = 5.5). The filter attenuations for this case are given by a = 2(k-l) db. 

K 

The graph is actually a plot of the equation 
H(go) = 20 log 10 


S 10 (k_1)/10 f 
k=0 


(jQ[ 


w/2 


k/N k/N 


A]) 


(4-26) 


for .5 <! u) < 20. 
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Chapter 5 

FOURIER SERIES METHODS FOR ANALYSIS OF A TRANSIENT HARMONIC TONE 
AND A PROJECTED ANALOG SYSTEM FOR 
ANALYSIS AND RESYNTHESIS OF MUSICAL TONES 


5.1 Introduction 

In this chapter, two "complete" methods for representing a transient harmonic 

signal by a time-variant Fourier series are derived. In both cases the computation 

of the Fourier coefficients, which vary with time, can be divided into two steps: 

(1) Hetrodyning the signal with quadrature pairs of each harmonic of the signal 

fundamental frequency, and (2) averaging or low-pass filtering of the hetro- 

dyned components. Although Step (1) is unique, there is a considerable amount 

of freedom for specifying Step (2), the low pass filter operation. In Section 5.6, 

a third low pass operation is found to yield a more accurate analysis of the test 
_t/T 

signal U(t)e cos(k2nt) (where k is an integer) than the other two types which 
correspond to "complete" methods for analysis. 

Finally, a plan for an electronic device which could be used to analyze 
musical sounds and which is an analog of the general method of time-variant 
Fourier series calculation, fs presented. This system makes use of the ultra¬ 
sonic frequency techniques developed in designing the Harmonic Tone Generator. 

The outputs of the analyzer could be used as inputs to an expanded version of 
the Harmonic Tone Generator which would then resynthesize the analyzed signal. 

As we have already seen, a harmonic tone generator can be used to generate 
a large class of signals. A subclass of possible signals of this type would be 
classified by musical listeners as consisting of harmonic tones even though 
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these tones are, strictly speaking, nonperiodic. It is only important that the 
perception of harmonicity be the predominant element which determines the timbre 
of a tone. A more analytic designation for this type of tone would be "quasi- 
periodic signal”. Deviations from periodicity can be caused by changing funda¬ 
mental frequency or by modulations of the harmonic amplitudes and phases. [Note 
that phase time-variations according to the relation 0 k (t) = 0 k +'k[<^ (t)-0^ ] 
are most conveniently interpreted in terms of the equivalent fundamental fre¬ 
quency variation, f = $j(t)] . 

Besides the problem of generating synthetic harmonic tones with time- 
varying parameters, it is equally important ft> consider the problem of determining 
the parameter specifications for signals which originate as acoustical events 
and which are known to be substantially harmonic, i.e., quasi-periodic. These 
signals include the tones of most musical instruments, certain sounds of nature, 
and the vowel sounds of human speech. Actually,there is one problem with speech 
sounds as compared to many musical sounds, which is the fact that their analysis 
in terms of harmonic structure is made difficult by the highly transient be¬ 
havior of the fundamental frequency. It is much easier to measure the movements 
of vowel formant peaks by means of time-variant Fourier transform or parallel- 
filter analyzer systems. Usually, information about the phases of the frequency 
components is not retained, and the analysis method has no guarantee of "com¬ 
pleteness”. However, since, m the case of musical tones, pitch is a relatively 
constant parameter, it is feasible to take advantage of their harmonic character 
and analyze by time-variant Fourier series methods instead of the more tradi¬ 
tional time-variant Fourier transform or parallel filter approaches . 
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5.2 Fourier Series Representation of a Quasi-Periodic Signal 

The well-known theorem of Fourier, which allows the computation of the har¬ 
monic structure of a periodic signal from the wave-shape of one of its cycles, 
cfcn be extended to cover the cases of nonperiodic and quasi-periodic functions. 
For the case of a function g(t) which is periodic with period T (i.e., g(t + T) = 
g(t)) and satisfies certain other conditions'^the coefficients of the Fourier 
series are given by the following formulas: 


1 ../ 


t* + T/2 


t’ + T/2 


g(x)dx, a = - 


g(x) cos 


E 27Tkxl 

~J' 


t* - T/2 


t* - T/2 


(5-1) 


t’ + T/2 


J g(x) sin^^Jdx, 


— 1,2,3,... 


t' - T/2 


where t' is an arbitrary time value. Then, the function can be expressed for 
all time by the infinite Fourier series in the following manner: 


27Tkt . 

a cos —— + b. sin 
k T k 


[»] 


(5-2a) 


c, cos 
k 


2rrkt 

T~ " 


(6-2b) 


31. Churchill, R.V. : Fourier Series and Boundary Value Problems . McGraw-Hill, 
New York, 1941, p.70. 
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(5-2c) 


If g(t) is a nonperiodic signal it can still be represented by a Fourier 
series in an arbitrary interval (t.,t ). It is only necessary to construct 
another function f(t) such that f(t) = g(t) for ^ < t < and f(t + tg - t 1 > = 
f(t) to see that this is true. That is, f(t) is equal to g(t) in the interval 
(tj,t 2 ) and is the periodic continuation of g(t) outside this interval. Since 
f(t), a periodic function, has a Fourier series representation good for all time, 
it follows that g(t) also has a Fourier series representation in the interval 
(t 1 ,tg)—indeed, it is the same series. Mathematically this is expressed by 

00 

g(t) = f(t) = a + S 

k=] 

. t 1 + t 2 

where T = t^ - t^. In particular, for t = t = --- 


T 


a, cos _ , 

k i t j 


b, sin 
k 


27 Tkt 


< t < t. 


(5-3) 


g(t') 



(5-4) 


where the coefficients are given by Equation (5-1). However, it is clear that 



Ill 


any interval (t-,t 0 ) can be chosen to compute such a Fourier series and we are 
X 2 

free to choose a set of intervals such that t - t = T is a constant and 

a JL 

t + t 
1 2 

t* _ --- is variable, taking on any value of time. In this case the 

the coefficients given by Equation (5-1) depend on t' and Equation (5-4) can 
be rewritten 


g(t') = a o (t’) + 


00 

2 

k=l 


a k (t’) cos 



+ b fc (t') sin 



(5~5a) 


= a Q (f) + 


oo 

2 

k=l 


c k (t’) c 



- <t> k <t’) * 


(5-5b) 


f QO 

As long as g(t) is piecewise continuous, the coefficients are continuous 
functions of time. Further, if the function is initially nonperiodic but becomes 
periodic with time interval T after a certain period of time, we are guaranteed 
that the coefficients will become constants. In any case, because of the Fourier 
theorem, we know that this Fourier series with time-variant coefficients is a 
"complete" representation of the function g(t); that is to say, g(t) can be re¬ 
constructed perfectly from a knowledge of these coefficients. 

33 

A modification of Equation (5-1) was used by Luce to compute the instan¬ 
taneous amplitudes and phases of the individual harmonics of a number of musical 

instrument tones. He chose the set of intervals (t ,t ) to coincide with the 

X 2 


32. Reference 31, p.65, 

33. Reference 2, pp. 20-217. 
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successive fundamental periods of the transient harmonic signals being analyzed 
(i.e., (0,T), (T,2T),...). Therefore, only one computation per period was 

performed, and since the signals were nonperiodic, the coefficients were bound 
to change discretely at the end points of each fundamental period. In this case, 
the representation of a nonperiodic signal given by Equation (5-5) is not strictly 
correct. For at these end points the Fourier series will converge to [g(t') + 
g(t' - T) ]/2 instead of g(t'). On the other hand, Luce's results indicate that 
in many cases the amplitudes and phases follow a smooth curve with respect to 
time so that a continuous curve might be inferred from it. 

5.3 Frequency Domain Equivalent of Fourier Series Coefficient Calculation 

Although Equation (5-5) is a "complete" representation, one might ask the 
question: How meaningful are the coefficient functions, a (t), a (t), b (t), 

O K K 

and their relatives, c fc (t) and <j> k (t), for a given signal g(t)? I propose to 
sidestep this question by posing a simpler question: Given a function which can 
be expressed by the equation 


g(t) 


n 

2 

k=l 


c (t) cos 

Iv 


-2flkt 

T 


+ $ k (t) 


(5-6) 


where o (t) and (f> (t) have a restricted amount of change over any given period 

K K 

(t, t+T), what is the relation of the c (t)'s and <j> (t)'s computed by Equations 
(5-1) and (5-2c) to the original c k (t)'s and <j> k (t)'s? Although no simple answer 
to this question can be given, it is possible to show by means of a frequency 
domain equivalent of the time-varying Fourier series calculation method that a 
certain amount of information contained in an oiiginal envelope c^(t) will, 
because of its transient behavior, be missing in the corresponding computed 
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c,(t). Moreover, the graph of each c (t) will show a certain amount of ripple 


due to the transient behavior of any of the parameters including the original 


V«- 

Note that Equations (5-1) and (5-5) can be combined to yield the identity: 
t + T/2 f t + T/2 

g(t) =7 / g(x)dx + Z) < j| J g(x) cos (~jT~) dxJ cosj^|^) 


v r 

■i f 


t - T/2 


t + T/2 


+ If J g(x) sin(2~)<lx 

*- t - T/2 


sin 




(5-7a) 


The second and third terms of Equation (5-7a) can be combined using a 
trigonometric identity: 


t + T/2 


t + T/2 


g(t) 


■» / 


g(x)dx + 


t - T/2 


I = / 

k=l . J „ 


g(x) cos 


t -T/2 


(t - x)J dx, 


(5-7b) 


This equation can then be rewritten in terms of convolution notation: 


g(t) = g(t)* |- [U(t - T/2) - U (t + T/2)] 


+ S g<t)* r [U(t 
k=l 1 


- T/2) - U(t + T/2)] cos^y 


(5-7c) 


where * indicates convolution. 


The -filter analysis system, which is equivalent to this time domain 
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analysis-synthesis system, can be obtained by taking the Fourier transform of 

34 

both sides of ( 5 - 7 c) and making use of the convolution theorem: 


F [g(t>] = G(jw) = G(ju>) 


sin((JT/ 2 ) 

oJT/2 


00 

+ 2 G(jw) 

k=l 


sin[ (oj + 27 Tk/T)T /2 ] 
(w + 2?Tk/T)T/2 


sin[ (to - 27 TkA)T /2 ] 
(w - 2ffk/T)T/2 


(5-8) 


Hence, the amplitude c (t) of the kth harmonic appears at the output of the kth 
band pass filter (one of an infinite set of arithmetically spaced band pass 
filters) whose response is given by 


_ sin[ (qj + 2yk/T)T/2 sin[ (to - 2?rk/T)T/2] 

k UU)i = (to + 27Tk/T)T/2 (u - 27 Tk/T)T /2 

since by Equation (5-7a) the time domain output corresponding to that of the 
kth filter is of the form, 

a k (t) cos^|^)+ b k (t) sin(~^)= c k (t) cos £ - (|> k (t)j. 

The response of the kth filter is zero at all integral values other than 
k times the fundamental frequency, meaning that steady state harmbnics other 
than the kth will not affect its output. However, if the kth harmonic and its 
neighboring harmonic frequencies are transient in nature, their Fourier trans¬ 
forms will not be concentrated strictly in lines and filter "cross-talk" will 


34 1 Cheng, D.K.: Analysis of Linear Systems , Addison-WeSley, Reading, Mass., 
1959, p. 227. 
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occur. With regard to the output of filter k, this interchange will result in 
the loss ot some detail of the original c (t) plus the introduction of other 
frequencies which will appear as ripple on the graph of c k <t). 

This equivalent filter system is useful as a theoretical tool for estimating 
the nature of the Fourier series representation of transient signals. In a real 
filter system of this type it would be impossible to extract the envelopes of 
the filter outputs without introducing another averaging process. 

It is also useful to consider that the computation of the coefficients as 
defined by Equation (5-1) can be divided into two steps: (1) Hetrodyning to 
DC and (2) Averaging or low pass filtering. (Calculation of the amplitude and 
phase by Equation (5-2c) represents a third step in the computation which is 
not really necessary to define a harmonic toneO The hetrodyning operation is 
perfectly straight forward: It simply involves the analog multiplication of the 
signal g(t) by quadrature pairs of harmonics tuned to each desired harmonic fre¬ 
quency of this signal. On the other hand, the integral averaging operation given 
by Equation (5-1) is not necessarily the best possible since the rejection of 
ripple due to transient envelopes is not optimum, and a precise adjustment of 
T is required to reject this ripple adequately even in the steady state; this 
becomes even more of a problem if the fundamental frequency of the signal is 
not constant. 

5.4 Rectangular Filter Approach 

Another analysis procedure which gives a "complete" representation of a 
quasi-periodic signal can be derived from the partitioning of the Fourier trans¬ 
form (frequency domain representation) of the signal into equal bands such that 
each harmonic frequency lies precisely in the middle of a corresponding band. 
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The operation of partitioning is equivalent to passing the signal through a 
set of band pass filters of the following form: 


H o (jto) = 




1, M<| 


0 , elsewhere 

i, (k - I)H<m< (k + i)H 

0 , elsewhere 
k = 1, 2, 3, .. . 


(5-10) 


Since this set of filters completely spans the frequency domain, the signal 
transform can be expressed by the following identity: 


□o 


G(jto) = 2 (j<o)G(jW) 


(5-11) 


Then, taking the inverse Fourier transform of both sides we obtain: 


OO 


g(t) = 2 F _1 [H k (jW)]*g(t) 

k=0 


(5-12a) 


7T/T 

= g(t)* f e J t I? + s e(t)* ^ 


-71/T 


k=l 




/■ 

<k-i)f 


-(k-i)f 


jwt dw 

p . 4- 

27T 


a J w t 

27T 


-(k + i>f 


(5-12b) 


= g(t)*[^sin(|i)] + 2 g(t)*[| r sin(2£) C os(^)] 


00 

-/ 

-00 


g(x) 


sm[ 


k=l 

7T(t-x) I 
T J 


7Tt 


(5-12c) 


7T (t-x) 
T 


-dx + 


2 > / <5 - l2d) 

k=i J T- 


-oO 
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oo 




-oo 


7T(t-x) 

T 


OO 

dx + 2 

k=l 


if 


g(x) cos 


27 Tkx 


sin[2%^] 

7T(t-x) 


dx 


,2flrkt x 

cos (__) 


OO 

+ 2 
k=l 


| y g <*> sm(5f£) 


Sin[^] 

7T (t-x) 


dx 


U “OO 


al n(2?Si) 


(5-12e) 


Note that Equation (5-12e) is precisely the same form as (5-7a) except that the 

t + T/2 


averaging operation 


/ 


has been replaced by 


t - T/2 


/ 


-00 


-^( t -x) ' ' WhiCh 


corresponds to an ideal sharp-cutoff low pass filter. With this method of aver¬ 
aging, equal weighting is given to retaining the original envelope of the par¬ 
ticular harmonic of interest and to excluding the ripple caused by other harm 
monies which may be highly transient. Moreover^ in the steady-state portions of 
a signal which is to be harmonically represented,, the ripple is completely re¬ 
jected even if the fundamental period T is chosen incorrectly or is changing. 6 t 
a rate slow compared to 1/T and within a range small compared to 1/T. 

Two examples of averaging operations which yield "complete" representations of 
transient signals have been presented. There may exist other such integral 
operations since it is only necessary that the integral weight function w(t-x) 
(considering that in all cases the integral limits are from -oo to °o) must 
satisfy the identity 


00 

1 = £ F[w(t)] + § 2 

k=l 


F[w(t) cos (-" - kt )J. 


(5-13) 
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If a harmonic simulation of a quasi-periodic signal is desired, complete¬ 
ness is an important consideration since we cannot hope to resynthesize an 
original signal exactly by means of the a k (t) and b^(t) coefficients if some 
information about the signal is lost through their computation. However, it is 
also essential that the amplitudes and phases of harmonics as calculated by 
Equation (5-2c) be interpretable. In general, this means that the spectral content 
of these derived "parameter-s^.gnals" should be substantially massed at frequencies 
much lower than 1/T. 

5.5 Comparison of Three Fourier Series Representations of a Simple Test Signal 
As mentioned in a previous section, the first step in computing the 
Fourier series coefficiencs involves hetrodyning the signal with quadrature 
pairs of the corresponding harmonic frequencies. If we suppose that the signal 
has the form given by Equation (5-6) (except that the phases are constant), the 
computation of the coefficient a (t) first irvolves the analog multiplication 

tv 

of g(t) with 2cos( 27 rkt/T) . This gives the result: 

2g(t)cos(27TktA) = c k (t)cos(<j> k ) + t c lt _ ;L ( t ) + c k+1 (t)] cos(<(> k )cos(2jj-t/T) , 

+ [ Vi (t) ■ cos ((j> k )sin (277t/T) + . . . (5-14) 

The second step is to average, or equivalently, to low-pass-filter the 
hetrodyned signal. Ideally, this operation would extract the term c (t) cos(cj) ) 
and completely reject the other higher frequency terms. However, since the spec¬ 
tral characteristics of the harmonic envelopes are normally not band-limited, 
the Fourier transforms of the various terms of Equation (5-14) will overlap and 
no linear filter will be able to discriminate between them perfectly. 
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With regard to the choice of the optimum transfer characteristic for this 
low pass filter, we are presented with an infinite array of possibilities, and 
simultaneously we are confronted with the so-called acoustical "Uncertainty 

,,35 

Principle which limits the accuracy with which amplitude and phase infor¬ 
mation pertaining to narrow band transient signals can be extracted. In order 
to provide a realistic illustration of this problem, the responses of three 
different averaging operators, each of which have quite simple time and fre¬ 
quency domain representations, to the test signal 


s(t) = U(t)e cos(k27Tt), k = 0, 1 , 2, . . . 


(5-15) 


will be compared. For simplicity we set T equal to unity. The response to sums 
of inputs of this type is obtained by superposition. 

In the time domain, the three integral operators are defined by their 
responses to an arbitrary input g(t): 


t+i 

(a) J g(x)dx , 
t-h 

t+i 

(b) J g(x)cos 2 [j7T(t—x)]dx , 


„ (x) sin[7T(t-x) ] d 
gW TT(t-x) ' 


(5-16) 


These operators are equivalent to the following frequency domain filter response 
characteristics: 


sin (cj) _ 

fa)[l - (fa)/7T) 2 ] 


f i, M 5 it 

(c) ^ (5-17) 

1 0 , elsewhere. 


35. Kock, W.E. : "On the Principle of Uncertainty in Sound," J. Acoust, Soc, Am. 
7(1) . 56-58, 1935. 
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These filter response characteristics are shown graphically by Figure 5.1 (a), 

(b), (c), respectively. All three share the property that their response is 

zero for integral values of 2H. Type (a) corresponds to the most simple linear 

operation, integration over a single period. Type (b) corresponds to an inte- 

2 

gration over two periods with a cos weight function, featuring much reduced 
response to frequencies exceeding 27T radians per second. Type (c) requires a 
much more difficult time domain integration (t'he limits are infinite and the 
weight function is bounded only by C/x); however, the frequency response is 
perfectly flat below cu = H rps, whereas all frequencies above this value are 
rejected. 

The ideal but absolutely unobtainable time domain response to s(t) (Equa¬ 


tion (5-15)) is 


|U(t)e t ^" T , k = O 


r(t) = 


, k > 0. 


(5-18) 


By substituting s(x) for g(x) in Equation (5-16) it can be shown that the response 
of operators (a) and (b) (with t > , t > 1, respectively) to s(t) both have 

the form 


r(t) = [A(k,T)cos(k27Tt) + B(k,T)sin(k277t) ] e 


(5-19) 


where the coefficients A and B for the (a) and (b) cases are given by 


A (k,T) 
a 


2T S inh(l/2T) 

rriiwrfp- > 


B (k,T) 
a ’ 


-4k7TT 2 sinh(l/2T ) 
1 + (2kTr)2 


(5-20) 


A b (k,T) = 


Tsinh(l/T) 




i 


i 


(2klfT)2 ~ _i_+ ((2k+l)7TT)2 " 1 + ((2k-l)7TT^2 
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H C (6U) 
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(cps) 

Figure 5.1 Low Pass Filter Characterist 


Three Averaging Operators. 
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2WTt (k + I> ffT (k - \^ T 1 

B b (k,T) “ “ Tsinh(1/T) 1 + (2WTt“ 1 + ((2k+l)7lT)Ji " 1 + ((2k-l)1TT)iiJ 

Graphs of these coefficients for k = 0,1, and 2 are given in Figure 

5.2. These should be compared to the ideal case where A(k,T) is equal to unity 

for k = 0 but is zero for nonzero values of k, and B(k,'?) is always zero. These 

curves together with Equation (5-14) allow an estimation of the "cross-talk 
ripple" which is superimposed on the graph of the derived Fourier coefficient, 
assuming that the coefficients of Equation (5-6) are behaving in an exponen¬ 
tial manner. It should be remembered that the amplitude of a transient component 
to be rejected may be much greater than that of the DC component (after hetro- 
dyning) which is to be extracted. Therefore, it is important that the rejection 
of non-DC transient components be as strong as possible. In both cases, with 
respect to the first harmonic ripple components, B is the least attenuated term. 
However, the type (b) response offers at least a 3 to 1 or 10 db improvement in 

attenuation of the term over the type (a) response for all values of t. For 

the second harmonic (k = 2) and large values of t the type (b) response offers 
a 4.8 to 1 improvement, i.e., an improvement of 13.6 db. ' 

Operators (a) and (b) are both distinguished by finite integration limits. 
This is especially advantageous if the computation of the Fourier coefficients' 
is to be done by digital computer. However, in the case of analog computation 
it becomes feasible to use filters with sharp-cutoff characteristics. This 
situation is epitomized by the frequency characteristic of operator (c). One 
might intuitively feel that this type of response would most effectively dis¬ 
criminate between low frequency and high frequency transient components. In 
actuality, there are limitations on the dynamic response of operator (c) to 
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T 

Figure 5,2 Coefficients of Equation (5-20) vs. Exponential 
Time Constant t. (k is a Parameter) 
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rapid transients because of the slowly damped prediction and memory capabilities 
of the impulse response sin(7Tt)/JTt. 

The sharp-cutoff filter response to the test signal s(t) can be calculated 
by substitution of s(x) given in Equation (5-15) for g(x) in Equation (5-16c): 


oo 


r(t) = 


f , x sin[JT(t-x) ] T,. 

J s(x) ~ ( TT ) ~ dlt “ J s<t - X 

-oo -oo 




■ /• 


(t-x)/T r ._o sin(7Tx) , 

cos[k27T (t-x)J-^- dx 


•t/T 


cos(k27Tt)( ^ 7r tan -1 [ (2k+l)7TT] ~ <| tan _1 [ (2k-l)7TT ] 


v 

*/ 


x/T ,, rvn x sin(1Tx) , 
e cos(k27Tx)—^- dx 


+ sin(k2Jrt) (|| ln[(l + [(2k+l)fT]V(l + [(2k-l)7TTj 2 )J 
^ t . 

+ / c^sinCkOTx) dx) 

0 ' - 


(5-21) 


For k = 0, this response reduces to: 


r(t) = e 


-t/T 


tan _1 (7TT) + J 

A 


x/t sin (fix) dx 

a ■ ■— 1 


7Tx 




(5-22) 


Here again the form of Equation (5-21) is identical to that oi Equation 
(5-19) except that in this case the A and B coefficients are dependent on time 
as well as the parameters k and t. For example, when It is equal to zero, the 
integral term given in Equation (5-22) is not bounded with respect to increasing 
time, although r(t) is. The integral term introduces a rather disagreeable error 
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which has a maximum at every odd integer value of t and a minimum at every even 
integer value. In other words, there is a relatively strong ripple component 
with frequency to = IT (the frequency of sharp cutoff) which does not decay as 

-t/T. 

rapidly as the desired component e 

For this more complex situation, compared to that of operators (a) and (b), 
it is necessary to use some sort of average criterion for comparing the errors 
of extraction or rejection of the envelopes for different values of k and T. 

One convenient formulation is 


error = 


error = 



, k = 0: 


(5-23a) 


k 4 0 . 


(5-23b) 


Values of error for operator (c) were calculated with the University of Illinois 
IBM 7094 computer as formulated by Equation (5-23) and are shown plotted as a 
function of T for k = 0, 1, 2 in Figure 5.3. It is interesting that for k = 0, 
Equation (5-23a) as applied to the response of operators (a) and (b) to the test 
signal yields simply |A-1| as the error. 

In summary, it appears that neither the rectangular time domain response 
operator (a) nor the rectangular frequency domain response operator (c) is opti¬ 
mal for evaluating the envelopes of frequency components which are rapidly changing 
over a large dynamic range,which is*a reasonable description of parts of the en¬ 
velopes of most musical tones. The operator (b) with its modified weight function 
represents a compromise between these two which is clearly better than either. 



RMS ERROR 


.6 



- Figure 5.3 Rms. Error of Operation (c) Response to 

e _t ' T cos (k27Tt) vs. t. (k is a Parameter) 
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Of course, the "best operator response characteristic would depend on the par¬ 
ticular test signal chosen. Equation (5-15) was selected because it is typical 
of the decay and attack curves found in music [s(t) may be thought of as a 
"decay", whereas s(-t) can be considered to be an "attack"], and it is relatively 
easy to handle on an analytical basis. 

5.6 A Projected Analog System for Analysis and Resynthesis of Musical Tones 

It is possible to construct by modification of the Harmonic Tone Generator 
design concept an analog analyzer for the calculation of instantaneous amplitudes 
and phases. The computational process consists of three steps: (1) Hetrodyning 
the signal with two quadrature components of each harmonic frequency, (2) sep¬ 
arate low pass filtering of the hetrodyned components, and (3) computation of 
the amplitudes and phases (which amounts to the solution of right triangles). 
Figure 5.4 indicates how this might be done using a variable and fixed carrier 
frequency. Specifically, if the signal to be analyzed, s(t), has the form of 
Equation (5-6), the output of the electronic switch driven by the fixed frequency 
oscillator is 

S c. (t)cos@Li + S (t)J U + S s in( J 7r a ) co s (j27rf t) 1 

k=i k T k L j=i ^ c J 


= i Sc k (t)co S [M + y t )] + s Sc k (t)^^cos[^i + $ k (t)]cos(j2irf c t) 
k=l j = 1 k=l 


= i Sc'(t)cos® + l (t)J 
k=l k T k 


oo n 

+J 2 2 

J=1 k=l 


■ (t) S -— ^ -(co S [27 I(j f c + j)t 4 k (t)] + cos[27r(jf e -|)t -^(t)]^- 


(5-24) 


where a/f is the pulse duration, 
c 
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The band pass filter for extracting the jth harmonic should be designed 

to pass the frequencies jf - (j+l)/T to jf + (j+l)/T without distortion, i.e., 

c c 

the amplitude response should be flat and the phase response linear over this 

frequency range. However, frequencies below (n+l)/T and above (j+l)f^ - (n+l)/T 

should be rejected. Except for some distortion of the harmonics exceeding the 

jth which are not of interest, the output of the band pass filter within a 

constant factor is s (t)cos (2j7Tf t). The output of the 90° phase shift circuit 

c 

is s(t)sin(j27Tf t). The output of one of the electronic switches driven by the 
c 

variable frequency oscillator, which is set to f + 1/T, is 
s(t)cos(j2rrf c t) |j + S cos[i27T(f c + 1/T)] 


_ Sltl ^^ a ~ s(t)cos(j21Tt/T) + high frequency terms. (5-25) 

The variable cutoff low pass filter removes the high frequency terms and averages 
the hetrodyned low frequency component. 

The advantages of this method are that any fundamental frequency can be 
handled and the tone may be played directly into the analyzer without an inter¬ 
mediate tape recorder. Also, variable cutoff low pass filters are commercially 
available which would allow adjustment of the bandwidth for the best compromise 
between retaining the transient characteristics of the Fourier coefficients and 
rejecting ripple. The amplitude outputs could be used as inputs to the Harmonic 
Tone Generator, which would in turn resynthesize the original tone; the two units 
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connected together would function in effect as a musical tone vocoder. Because 
all harmonics are analyzed simultaneously, the relative phases of the harmonics 
are correct with respect to one another, although the stability of the absolute 
phase depends on the stability of the variable and fixed carrier frequencies. 

The obvious drawbacks of this system, of course, are its complexity and 
its cost. It requires separate fixed band pass filters, low pass filters, complex 
nonlinear networks, and a multi-channel chart recorder. The system is tremen¬ 
dously simplified if only one frequency at a time is analyzed or if the funda¬ 
mental frequency is fixed. For example, Figure 5.5 shows a system for measuring 
the amplitude and phase derivative of a particular frequency f (the absolute 
or relative phases are of no consequence since the analyses of different fre¬ 
quencies are not performed simultaneously). Note that some simplification was 
possible as compared to a single harmonic channel of the system in Figure 5.4. 

A time mark must be recorded on the tape immediately preceding the start of the 
tone to be analyzed in order to identify the time reference of the analysis. 

Figure 5.6 illustrates a system which analyzes at a standard fundamental 

frequency f . The band pass filter is fixed and its pass and rejection bands 
Si 

are optimized to pass one harmonic and reject the others. The filter bandwidth 
is very narrow compared to its center frequency; hence, the symmetry of the 
filter transfer response is arithmetic to a close approximation. The harmonic 
to be analyzed is selected by choosing the proper hetrodyne frequency. The 
amplitude of the output is the desired amplitude, and the phase derivative can 
be computed by means of a phase-detector and differentiator. Other fundamental 
frequencies can be accommodated by means of a variable speed tape recorder, 
although care must be taken to insure that proper equalization of the original 






















Figure 5.6 Single Fixed Frequency Transient Tone Analyzer. 
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frequencies is maintained, and that the speed is constant. 

The chart speeds for plotting of musical instrument transients must be 
quite fast--perhaps too fast for ink types. More accurate ink curves could be 
obtained by recording the data at high speed on an FM tape recorder and playirig 
back at reduced speed. Light-beam oscillographs are commercially available, 
but the graphs resulting from these units are generally inferior in quality to 
ink types, must be fixed by photographic processes, and are difficult to repro¬ 
duce. Another possible solution is continuous photography of oscilloscope screen 


representations of the transients. 
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Chapter 6 

A PERCEPTUAL CHARACTERISTIC OF HARMONIC TONES: THRESHOLD 
FOR "HEARING OUT" THE UPPERMOST PARTIAL 

6.1 Introduction 

When a person hears a harmonic tone m musical context he does not nor¬ 
mally separate out the individual frequencies which the tone comprises. Rather 
he hears the tone as a single entity or "Gestalt”. Nevertheless, it is a well- 
known fact that the listener can sometimes perceive the pitch of a harmonic 
as distinct from that of the tone itself if one or more of the following con¬ 
ditions are satisfied: (a) The harmonic tone has a relatively fixed pitch and 
amplitude structure; (b) the listener can form a proper mental image of the 
frequency he is to hear out in advance; (c) the harmonic in question is suf¬ 
ficiently intense relative to the rest of the tone (and especially relative 

to the partials adjacent to this harmonic). Limitations on this type of per- 

36 

ception were recently investigated by Plomp, Two of his conclusions are: 

(1) "... partials of a complex sound can be heard out' only if their frequency 
separation exceed the [so-called] critical bandwidth." Consequently, it is 
difficult to hear out harmonics above the ninth for tones with flat spectra. 

(2) ", . . the old conviction, that the third and fifth partials — m general 
the uneven ones — are easier distinguished than the even partials, is not con¬ 
firmed [by Plomp*s data]," (Several references to observers who do share this 
conviction are given m his article.) 

36, Plomp, R. . "The Ear as a Frequency Analyzer," J. Acoust, Soc. Am,, 36(9), 
1623-1636, 1964. 
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Plomp's results are based on a test which uses forced decision compar¬ 
isons of pure sine tones to harmonic tones consisting of the first 12 partials 
at equal intensity. The experiment described here uses an entirely different 
method to determine the relative amplitude of a partial necessary for it to be 
heard out, and, incidentally, whether or not the odd partials are easier to 
hear separately than the even ones. In the latter case, the data derived from 
this experiment shows agreement with Plomp’s conclusion. 

6.2 Experimental Method 

Fourteen listeners, who are highly trained musicians or have had consid¬ 
erable experience m sound perception, were tested separately to determine 
their individual limitations with respect to hearing out the individual partials 
of harmonic tones. At each of five fundamental frequencies 65, 130, 260, 520, 
and 1040 cps., a group of five tones was^presented to the listeners via a loud¬ 
speaker (Acoustics Research AR-3). The five tones consisted successively of the 
first two partials, the first three partials, and so on. Each partial of a 
tone except the highest was set at the same subjective intensity level (in 
phons) as measured by a sound level meter (B&K 2203). The levels were set by 
the appropriate attenuator controls on the Harmonic Tone Generator, which was 
used as the sound source in this experiment. Since the experimental laboratory 
was not soundproof, it was necessary to use playback levels which were both 
"comfortable" and at the same time sufficiently intense to allow the listener 
to ignore any background noise present. These requirements resulted in the 
choices 70 phons ar 65, 260, and 520 cps., 80 phons at 130 cps., and 60 phons 
at 1040 cps. 

Each listener was instructed to adjust the intensity of the highest har¬ 
monic of a given tone to the lowest level where without question he (or she) 
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could distinctly perceive the pitch of this harmonic as separate from that of 
the tone itself. This level was measured in acoustical decibels with the sound 
level meter and a wave analyzer (Hewlett-Packard 302A) tuned to the perceived 
harmonic frequency. Later the value was converted to subjective intensity units. 
The wave analyzer was used to insure that the actual acoustical intensity of 
each harmonic was being measured, thus eliminating errors which could be intro¬ 
duced by the effects of system intermodulation distortion and by distortion of 
the original harmonic components. The experimental set-up is given by Figure 
6.1. The sound level meter microphone was placed near the listener's left ear 
(each listener was instructed to stop his right ear) so that the reading 
would correspond as close as possible to the sound level at this ear. 

Errors in the measurements were bound to occur because of a number of 
factors such as the variation in the room background noise, the variation of 
the room standing-wave pattern with different physical positions of the listener 
and the experimenter (an attempt was made to keep this constant), the variation 
of the listener's interpretation of the experimenter's definition of the level 
at which a harmonic is perceived separately, subjective harmonics, and the 
possibility of confusing imagination with perception, as well as ordinary 
measurement errors. Standing waves would not have been a problem had earphones 
of sufficient quality been available. However, all of the earphones tested 
had a tendency toward introducing excessive distortion and sympathetic vibration 
sounds, and provided an auditory "closeness" which greatly increased the level 
of auditory fatigue. In an attempt to average out the effects of fatigue from 
the beginning to the end of the experiment, the order of the fundamental 
frequencies of the tones presented was rotated. 
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Figure 6.1 Test Configuration for the Psychoacoustic Experiment: 
"Hearing Out" the Uppermost Partial of a Harmonic Tone. 
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6.3 Calculations 

For each of the harmonic tones presented the levels of the individual 
constituent harmonics are set to equal subjective intensity levels m phons. 
Since sound level meter measurements are made in acoustical decibels r (db) 

it is necessary to translate from the specified level in phons to the level 

' 21 
in dbs by means of the Fletcher-Munson loudness curve. The loudness of a 

harmonic tone excepting the nth partial is calculated by the conventional 

rule of energy combination. That is, the total loudness is given in terms 

of the individual loudnesses which are all equal to by 

XI"" 1 

L = lOlog S[lO Ll/2 °] 2 = lOlog [(n-l)10 L I /10 ] 

k=l 

= Lj + 101og i() (n-l) (phons) (6-1) 

The threshold criterion of most interest to me was the difference between the 
loudness of the nth partial required for the listener to hear it out. This 
gives the relative subjective intensity threshold of the uppermost harmonic 
when this harmonic is between the second and the sixth for five fundamental 
frequencies. The data values so derived are plotted m Figure 6.2 for each of 
the five frequencies together with the mean threshold values and the average 
absolute deviations averaged over the 14 listeners. Each average absolute devi¬ 
ation was computed as the average of the absolute values of the differences 
between the corresponding data values and the mean value. 

6.4 Conclusions 

Because of the large spread of the data it is difficult to infer any 
strong general conclusion about the results other than that the average 
threshold value for hearing out the highest partial is about -20 phons, 



RELATIVE SUBJECTIVE INTENSITY (PHONS) 



• DATA VALUE 

o MEAN VALUE 

. MEAN VALUE ♦ AVERAGE 
ABSOLUTE DEVIATION 
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independent of frequency and harmonic number. The mean values show a drop of 
10 phons from the second to the sixth harmonic for the 65 cps. tones and 
show a non-monotonic increase of 7 phons for the 130 cps tones. The 260 cps. 
and 520 cps. tones show only slight increases of 2 5 and 4 phons For the 
1040 cps. tone the average threshold is a constant -26 phons for harmonics 
2 and 3 but shows a sharp increase for higher harmonics. Three listeners 
could not hear out one or more of the fourth, fifth, and sixth partials of the 
1040 cps. tones and the sixth partial of a 520 cps. tone under any available 
means for increasing the strength of these partials. This phenomenon of relative 
uncertainty in discriminating partials above the fourth for frequencies ex¬ 
ceeding 500 cps. is m agreement with the results of Plomp's investigation. 

Only in the case of the 130 cps fundamental are the mean threshold values 
of the odd harmonics lower than they would be if they followed the general 
trend of change with respect to harmonic number. With this exception none of 
the data indicates that the odd harmonics are easier to hear out than the even 
ones under steady state conditions. 

GL. 5 The Listeners 

Participants in this listening experiment were: 

1. Theodore Murname 
2 Royal MacDonald 

3. Bruno Nettl 

4. Barbara Shapiro 
5 David Schroer 
6 . Gary Grossman 


7. Rhoda Grossman 



141 


8 . Donald Andrus 

9. Neely Bruce » 

10. James Beauchamp 

11. Amy Larson 

12. Thomas Howell 

13. Julia Sentman 

14. Burt Levy 

None of the listeners experienced any difficulty in hearing out the 
partials, the intensity of which were under their own control. Some persons 
took much more time deliberating over the correct threshold values than 
others. The time for the complete testing of one listener ranged from 40 
minutes to 70 minutes. Most people experienced some auditory fatigue 
because of the long tone durations used in the test. Some listeners were 
extremely acute at certain frequencies, but radically less so at other fre¬ 
quencies. The average thresholds for individuals ranged from 9 phons to 


26 phons. 
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